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ABSTRACT 

A population  of  Items  which  break  down  at  random  times  and  require  repair 
is  studied  (the  classic  "machine  repair  problem  with  spares”).  It  is  desired  to 
determine  the  number  of  repair  chimnels  and  spares  required  over  a multiyear 
planning  horison  in  which  population  sise  and  component  reliability  varies,  and  a 
service  level  constraint  is  imposed.  When  an  item  fails,  a spare  (if  available)  is 
immediately  diq>atched  to  replace  the  failed  item.  The  failed  item  is  removed, 
transported  to  the  repair  depot,  repaired,  and  then  placed  in  the  spares  pool 
(which  is  constrained  to  be  empty  not  more  than  10%  of  the  time)  unless  there  is 
a backlog  of  requests  for  spares,  in  which  case  it  is  dispatched  immediately.  The 
first  model  conddered  treats  removal,  transportation,  and  repair  as  one  service 
operation.  The  second  model  is  a series  queue  which  allows  for  the  separate  treat- 
ment of  removal,  transportation,  and  repair.  Breakdowns  are  assumed  Poisson 
and  repair  times  exponential. 

1.  INTRODUCTION 

One  of  the  earliest  applications  of  queueing  theory  to  spares  provisioning  problems  was  the 
work  of  Taylor  and  Jackson  [12],  in  which  a finite  source  queueing  model  with  spares  was  used 
to  determine  the  number  of  spare  engines  required  to  maintain  a fleet  of  aircraft  at  a certain  efficiency 
level.  A bibliography  of  some  recent  work  on  queueing  approaches  to  provisioning  type  problems 
is  given  in  Lureau  [6].  Of  particular  interest  are  Refs.  [2],  [8],  and  [9].  The  problem  treated  in  this 
paper  is  the  determination  of  an  adequate  number  of  spares  and  repair  lines  (servers)  for  replacing 
and  repairing  components  which  randomly  fail,  Msnming  that  the  failed  components  are  replaced 
by  spares  (if  available)  and  once  repaired,  in  turn  become  spares.  A multiyear  planning  horizon 
is  considered,  allowing  for  growth  both  in  component  population  size  and  component  reliability. 

When  a component  fails,  a request  for  a spare  is  immediately  placed.  If  no  spare  is  available, 
a delay  occurs.  A service  level  constraint  of  10%  is  imposed  on  such  delays;  that  is,  we  desire  a 
capability  such  that  at  least  00%  of  the  requests  for  spares  are  immediately  filled  from  on-hand 
spares  inventory  (at  most,  10%  backordering  of  spares  requests).  This  service  criterion  is  often 
refeired  to  as  availability  or  fill  rate. 

*Tlii8  work  was  dons  under  the  Office  of  Naval  Reaearoh  Contract  N00014-7S-C-0739,  Project  NR  347  030. 
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The  objective  then  becomes  one  of  minimising  expenditures  for  spares  and  servers  subject  to 
a 90%  fill-rate  constraint.  The  classical  machine  repair  queueing  model  with  spares  is  the  driving 
model  in  that  it  determines  fill  rate  given  a specific  number  of  spares  and  servers,  the  component 
population  size,  failure  rate,  and  service  rate.  Times  between  failures  are  assumed  to  be  exponentially 
distributed  random  variables,  as  are  service  tim*es  (which  include  the  time  required  for  removal, 
transportation,  and  repair).  This  queueing  model  is  then  embedded  in  a heuristic  cost  optimization 
model  which  determines  a “good”  mix  of  spares  and  servers  for  each  year  in  the  planning  horizon 
while  satisfying  the  fill-rate  constraints. 

An  alternate  queueing  model  is  also  presented  which  treats  removal,  transportation,  and  repair 
as  separate  serving  stations,  but  which  must  assume  an  “infinite”  calling  population.  The  accuracy 
of  this  assumption  is  also  investigated. 

The  methodology  presented  here  has  been  used  for  provisioning  servers  and  spares  for  a fleet 
of  marine  gas  turbine  engine  ships  (each  engine  having  two  components — a gas  generator  and  a 
power  tiurbine).  It  is,  of  course,  applicable  to  other  similar  provisioning  problems. 

2.  QUEUEING  MODEL 

In  order  to  determine  the  probability  of  a request  for  a spare  being  met  without  delay,  it  is 
necessary  to  calculate  the  probabUity  of  various  numbers  of  components  in  the  repair  system  at 
any  particular  time.  Changes  in  the  population  with  respect  to  size  and  reliabUity  take  place  at 
random  times  throughout  each  year,  so  that  unless  (or  until)  population  size  and  component 
reliability  stop  changing,  steady  state  cannot  be  approached.  It  appears  analytically  intractable 
to  calculate  transient  state  probabilities.  As  an  approximation,  we  consider  the  population  to  be 
in  steady  state  at  its  average  size  and  reliabUity  for  an  entire  year,  changing  instantaneously  to  a 
new  steady-state  position  at  new  average  values  at  the  begiiming  of  each  new  year.  In  situations 
where  faUures  are  frequent,  trantient  effects  should  die  out  quickly  and  our  steady-state  approxi- 
mations should  be  adequate.  However,  if  faUures  are  infrequent,  the  transient  effects  will  take 
longer  to  disappear  and  might  present  a problem.  Nevertheless,  for  most  provisioning  purposes, 
the  assumption  of  instantaneous  steady-etate  should  be  adequate. 

At  any  point  in  time,  the  population  is  composed  of  units  which  may  have  different  faUure 
rates,  since  units  added  to  the  population  in  later  years  generally  are  more  reliable  due  to  tech- 
nological learning.  It  is  assumed  here  that  for  each  year,  all  components  have  identical  faUure 
rates  equal  to  the  population  average,  which  changes  on  a yearly  basis  as  described  above.  This 
assumption  that  all  components  operate  at  the  population  average  failure  rate  was  investigated 
in  Ref.  [5],  and  the  results  will  be  briefly  described  later  in  this  paper. 

We  introduce  the  following  notation: 

p,,<=steady-state  Pr  {n  components  in  repair,  year  i] 
e,=number  of  repair  facilities,  year  i 
yi=number  of  spares,  year  i 
X,= component  faUure  rate  (Poisson  mean),  year  i 
X(= population  average  faUure  rate,  year  i 
R(=expected  number  of  components  repaired,  year  i 
AT, = component  population  size,  year  i 

1/m<= average  service  (removal,  transportation,  and  repair)  time,  (exponential  mean), 
year  t. 
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The  equations  for  the  steady-state  probability  of  n items  “down”  (i.e.,  in  repair)  are  given  as 


^ W 

/XA*_ 

(iV«— n+!/«)!c7‘‘‘c 


(0<n<c<) 


(c<<n<yi)  <c<<y«) 


i>o.<  (y«<»<y<+A^«) 


(0<n<y,) 


(iV<— n+y,)!c?”‘<c 


{(iV| — Ar(_i)X<+5|_iXi_i-|-(A/^<-i — 
{5<_iX<_i-|-(Arj_i — B|_i)Xi_i}/A/"«-i, 

5, =X,  j^iV,  (n-y<)p«.  ij  • 


N,>N,.r 


Equation  Set  (1)  gives  the  standard  fomulas  for  a machine  repair  model  with  spares  (see  Gross 
and  Harris  [4],  p.  123,  for  example). 

Using  the  average  failure  rate  X for  each  year  allows  for  the  incorporation  of  changing  com- 
ponent reliabUity  as  the  years  progress.  Equation  Set  (2)  shows  how  X is  computed.  It  is,  perhaps, 
easiest  to  explain  in  the  context  of  the  marine  gas  turbine  engine  example  mentioned  previously. 
Year  one  begins  with  the  first  introduction  of  a few  gas  turbine  ships,  and  each  succeeding  year 
brings  into  the  fleet  additional  ships,  until  the  fleet  reaches  full  strength.  The  components  intro- 
duced in  the  later  years  generally  have  improved  reliability,  either  through  learning  or  a conscious 
component  improvement  program  (CIP).  Thus,  X,  tends  to  be  smaller  than  X,-!.  When  the  popula- 
tion size  is  increasing  (iVj>iV<_i),  the  average  failure  rate  over  all  components  in  the  population 
for  year  i,  X*,  is  given  as  the  weighted  average  of  the  new  components  introduced  into  the  population 
at  the  best  current  achievable  failure  rate  X<,  those  repaired  during  the  past  year  at  that  year’s 
best  achievable  failure  rate  Xt-i,  and  those  old  components  in  the  population  which  were  not 
repaired  and  are  thus  operating  at  the  old  average  failure  rate  X|_i.  If  the  fleet  size  is  decreasing 
(N,<.Ar,_i),  (i.e.,  some  ships  may  be  retired  from  service),  then  the  computation  is  given  by  the 
second  equation  of  (2),  namely,  the  weighted  average  of  those  repaired  last  year  coming  in  with  a 
failure  rate  of  X|_i  and  those  not  repaired  but  operating  at  the  old  average  X«_i.  The  average  num- 
ber repaired  in  a year  is  given  by  (3). 

The  assumption  that  all  components  operate  at  an  average  failure  rate  was  investigated  in 
Ref.  [6]  by  developing  the  exact  model  for  unequal  failure  rates  with  N=l,  Y=l,  and  c=l,  so 
that  one  component  has  a failure  rate  of  Xi  while  the  other  ' as  a failure  rate  of  Xi.  It  turns  out 
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that  (1),  aasuming  both  units  operate  at  X=(Xi+X()/2,  pves  conservatiTe  results  with  respect  to 
availability;  i.e.,  the  actual  availability  is  greater  than  that  computed  using  X,  and  (2)  even  for 
sizable  differences  in  Xi  and  Xi,  the  approximate  availability  using  T b close  to  the  actual.  The 
reader  is  referred  to  Ref.  [5]  for  more  detail.  Result  (1)  has  some  intuitive  appeal,  since  components 
having  higher  failure  rates  would  be  expected  to  fail  (and  hence  be  in  repair)  more  often.  Thus, 
the  more  reliable  components  operate  a larger  proportion  of  the  time,  so  that  the  actual  failure 
rate  for  the  system  would  tend  to  be  lower  than  the  arithmetic  average  of  the  failure  rates  over 
all  components.  Nevertheless,  it  appears  from  Result  (2)  that  this  effect  is  not  highly  significant 
and  that  using  the  population  average  failure  rate  for  each  component  is  an  adequate  approximation 
in  most  cases.  For  example,  using  the  exact  model  with  N=T =e—  1,  it  was  found  that,  even  for  Xt 
twice  that  of  X|,  the  percentage  error  in  availability  was  no  more  than  5%  when  using  T as  the 
failure  rate  for  each  component.  In  actual  applications,  the  individual  component  failure  rates 
tend  to  be  much  closer  together  than  a factor  of  two,  since  reliability  growth  is  gradual,  and  hence 
we  feel  confident  that  the  average  failure  rate  assumption  is  justifiable. 

Once  the  < are  obtained,  the  fill-rate  constraint  is  given  (temporarily  dropping  the  sub- 
script t)  by 

^1>.>0.90, 

since  when  n components  are  down,  there  are  y—n  spares  available  (n< y).  Thus,  the  probability 
of  no  spares  on  hand,  Fmt,  is 

and  the  percentage  of  requests  filled  immediately  from  on-  hand  spares  is 

^ (1 

However,  the  probability  that  tk  Jailed  component  jindt  n in  the  eyetem  should  be  used  in  the 
fill-rate  computation.  In  conunon  queueing  terminology,  these  are  the  "arriving  customer"  prob- 
abilities and,  in  the  context  of  this  paper,  correspond  to  the  occurrence  of  component  failures  \ 

which  generate  requests  for  spares.  Thus,  they  shall  be  referred  to  as  failure  point  probabilities. 

For  the  finite  source  with  spares  queue  considered  in  this  paper,  the  failure  point  probabilities  are 
not  equal  to  the  general  time  probabilities  given  by  (1).  For  the  finite  source-no  spares  queue 
(see,  e.g.  Cooper  [1],  pp.  82  ff.),  the  failure  point  probabilities  are  equivalent  to  the  general  time 
probabilities  for  a finite  calling  population  of  one  less,  but  this  relationship  does  not  hold  for 
the  spares  case. 

The  failure  point  probabilities  for  the  finite  source  with  spares  queue  (denoted  by  q„  as 
contrasted  to  the  general  time  probabilities,  denoted  by  p»)  may  be  derived  as  follows.  Using 
Bayes’  theorem, 

2aaiP]‘{n  in  syBtem|failure  about  to  occur} 

Pr{n  in  system)  Pr{failure  about  to  occur|n  in  astern} 

~T*.  fPrfn  in  system!  ftlfailure  about  to  occurln  in  system)] 
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Dividing  numerator  and  denominator  by  At  and  taking  the  limit  yields 


[ NPn  , 

t+N 

N-2  (.n—y)pn 

*-» 

(N—n+y)pn  . 

t+tf 

N-S  (n-y)p. 


(0<n<y), 


{y<n<y+N), 


where  the  p,  are  given  in  Equation  (1).  Thus,  the  fill-rate  constraints  must  be  based  on  the  that 
is 

S 2->0.90. 

n-0 

For  the  R calculation,  the  general  time  probability  is  required  so  that  (1)  is  used  as  given, 
and,  in  fact,  R equals  the  denominator  term  of  g.  multiplied  by  X. 


3.  COST  MODEL 

The  cost  model  considers  four  types  of  costs:  (a)  purchase  cost  of  spares,  (b)  purchase  cost 
of  service  channels,  (c)  repair  costs,  and  (d)  investment  costs  in  component  improvement  (relia- 
bility improvement)  programs.  Annual  operating  costs  associated  with  running  the  spares  inventory 
system  or  operating  the  service  channels  are  not  included.  While  the  variable  portion  of  these  costs 
can  be  important,  they  are  not  considered  in  this  model,  since  the  major  purpose  of  such  a strategic 
planning  model  as  this  is  for  capital  budgeting  and,  hence,  it  is  the  purchase  expenditures  which 
are  of  prime  concern  at  this  stage  of  the  decision-making  process. 

The  optimization  problem  is  an  integer-nonlinear-programming  problem  with  a hard-to- 
manage  objective  function  and  highly  complex  constraints.  An  integer  programming  algorithm 
approach  has  not  as  yet  been  successful  in  solving  the  problem,  hence,  a heuristic  method  is  utilized. 

The  heuristic  cost  “optimization”  algorithm  considers  explicitly  only  the  first  two  categories 
of  costs,  although  the  repair  and  component  improvement  program  (CIP)  costs  ore  always  cal- 
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culated  for  each  year  so  that  sensitivity  analyses  with  respect  to  various  CIP  programs,  service 
times,  etc.,  can  be  performed.  It  should  be  noted  that  even  with  no  CIP  program,  X and  N would 
still  change,  but  the  CIP  program  influences  the  magnitudes  of  these  changes.  The  present  worth 
of  the  cost  stream  over  the  planning  horizon  is  always  calculated  for  just  such  a use.  This  will  be 
illustrated  in  the  next  section. 

The  cost  minimization  problem  can  be  stated  as  follows.  Considering  the  additional  notation 
Cl.  (=  purchase  cost  of  a repair  channel,  year  i 
Cj,i= purchase  cost  of  a spare  component,  year  i 
Cj.<=cost  of  repairing  a component,  year  i 
Ci,i=investment  in  reliability  growth  (CIP)  program,  year  i 
d= discount  factor 

Ar=number  of  years  in  planning  horizon, 
we  desire  to 

(4)  Min  Z=-hd*[Ci.,{c,-c,-,)++C,,,{yi-y,.d^+Ct.tR,+Ci.,] 

Cl,  et ci;  yi,  pt, . ...pi 

(5)  Subject  to:  ^ 2n.<>0.90,  (i=l,  2 . . .,k) 


Ci>0;  integer. 
yi>0;  integer. 

The  plus  superscript  on  the  first  two  cost  terms  indicates  that  the  term  is  zero  if  the  factor  in 
parentheses  is  negative ; that  is, 

(a— 6)‘''=max{(o— 6),  0}. 

The  last  term  in  Equation  (4),  Ct,  t,  is  the  cost  of  the  CIP  and  does  not  explicitly  depend  on 
Ci  or  yt,  although  indirectly  one  can  argue  that  if  Ct.t  is  increased,  X,  will  be  smaller,  thus  affecting 
the  p^i  calculation  and  hence  the  constraint  (5).  The  heuristic  algorithm  ignores  this.  This  CIP 
effect  is  observed  via  a sensitivity  type  of  analysis  referred  to  above. 

The  third  term,  which  involves  Hi,  is  a function  of  y,  (explicitly,  see  Equation  (3)),  and  yi 
and  Cl  implicitly  through  p,, ,.  However,  when  compared  to  purchase  costs  of  spares  and  servers, 
the  annual  repair  costs  should  be  small  and  are  ignored  by  the  heuristic  algorithm.  It  is  included 
in  the  cost  calculation  because  it  is  directly  influenced  by  reliability  growth  and  is  needed  for  any 
sensitivity  study  of  CIP.  Thus,  the  heuristic  algorithm  looks  only  at  the  costs  of  purchasing  spares 
and  servers  and,  furthermore,  considers  only  one  year  at  a time;  that  is,  it  attempts  to 

Min  Zi=Ci,i(ei — Ci_i)‘*‘+Cin(yi — 1^4-1)''' 

t4.  Pi 

for  each  t=l,  2,  . . .,  it  in  a sequential  manner,  using  the  “best”  combination  it  finds  for  year 
t-1  (C(-t,  y<-i)  as  the  starting  point  for  its  search  for  (c<,  y<). 

For  the  year  i computations,  the  algorithm  first  checks  to  see  if  Ci-\,  yi-i  satisfy  the  con- 
straint. If  the  constraint  is  not  met,  the  algorithm  computes  the  ratio  A=Ct.i/Ci,<  and  takes 
the  largest  integer  value  contained  therein.  (In  the  applications  we  have  studied,  (7».i  was  always 
larger  than  C,,i.  If  this  is  not  the  case,  the  algorithm  should  be  modified  to  compute  Ci.JCt,i, 
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and  the  words  “server”  and  “spare”  interchanged  in  the  description  that  follows.)  Thus,  for 
an  (approximately)  equal  dollar  expenditure,  we  can  purchase  one  spare  or  A servers.  Both 
points  (Ci-i+A,  2/<_i)  and  (c<_:,  y(_i+l)  are  checked  to  determine  the  fill-rate  percentage  via 
use  of  the  queueing  model,  and  the  point  with  the  largest  fill-rate  percentage  is  chosen.  This 
procedure  is  continued  from  the  “new  points”  chosen  until  the  fill  rate  reaches  (or  exceeds)  90%. 
If  90%  is  exceeded  (and  it  generally  will  be  because  of  the  integer  values  required  for  c and  y), 
a reduction  procedure  is  employed  to  see  if  any  servers  can  be  decreased  (since  A servers  at  a time 
have  been  added)  and  still  maintain  a 90%  fill  rate.  Also,  if  the  feasible  region  has  been  entered 
by  adding  A servers  (as  opposed  to  adding  a spare)  in  addition  to  decreasing  servers,  an  attempt 
is  made  to  reduce  the  number  of  spares.  This  reduction  portion  of  the  algorithm  is  intended  to 
find  a solution  as  near  to  the  constraint  boundary  as  possible.  The  procedure  is  schematically 
shown  in  Figure  1,  which  displays  all  the  points  (c,  y)  evaluated  by  the  algorithm  along  with  the 
path  of  greatest  improvement  in  fill  rate.  Note  that  for  each  point  on  the  path,  fill  rate  is  calculated 
for  the  points  immediately  above  and  to  the  right,  and  then  a move  is  made  to  the  one  with  the 
largest  fill  rate. 

Should  the  starting  value  for  year  i(cj_i,  y<_i)  be  such  that  the  90%  fill  rate  criterion  is  ex- 
ceeded (this  may  happen  if  population  size  is  decreasing,  i.e.,  units  are  being  retired  from  service), 
then  the  algorithm  immediately  goes  into  the  reduction  mode,  first  trying  to  reduce,  to  the  greatest 
extent  possible,  spares  (assuming  that  they  are  more  expensive  than  servers  and  hence  should  have 
a larger  salvage  value)  and  then  attempting  to  reduce  servers  to  get  as  close  to  the  90%  boundary 
line  as  possible.  Again,  if  servers  are  more  expensive  than  spares,  the  words  “server”  and  “spare” 
should  be  interchanged. 
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FIGURE  1.  Heuristic  algorithm. 
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4.  SAMPLE  RESULTS 


Using  the  marine  gas  turbine  fleet  as  an  example,  Table  1 shows  the  results  for  an  11-year 
planning  horizon,  from  1975  to  1985,  using  an  interest  rate  of  10%.  The  fleet  size  starts  out  at  a 
relatively  small  number,  building  up  to  full  strength  by  year  1985.  The  component  exhibited  is 
the  gas  generator  component.  Similar  calculations  were  performed  for  the  power  turbine  component. 
Generators  and  turbines  are  completely  independent  as  they  require  different  repair  facilities,  and, 
of  course,  spares  are  not  interchangeable. 

The  first  nine  columns  of  Table  1 (except  the  X column,  column  4)  are  input.  Column  1 gives 
the  year;  Colunm  2,  the  anticipated  population  size;  Column  3,  the  mean  failure  rate  schedule  for 
the  particular  CIF  chosen  (in  failures  per  day) ; Column  5,  the  average  service  (including  removal, 
transportation,  and  repair)  time  (in  days) ; and  Columns  6-9,  the  purchase  cost  of  servers,  purchase 
cost  of  spares,  unit  repair  cost,  and  investment  in  CIP  cost,  respectively,  with  C\  through  Ci  being 
in  thousands  of  dollars  per  unit,  while  f?!  is  in  thousands  of  dollars  per  year. 

Column  4 and  the  last  five  columns  are  output.  It  was  convenient  to  show  X|  (Column  4) 
next  to  X,  so  that  one  can  readily  see  the  reliability  growth.  Columns  10  and  11  give  the  algorithm’s 
"best”  Ct,  yi.  Column  12  shows  average  number  of  units  repaired  for  each  year,  while  Column  13 
provides  the  expenditures  (in  thousands  of  dollars)  for  year  t.  Column  14  gives  the  present  worth 
of  the  discoimted  cost  stream  up  to  and  including  year  i.  The  final  value  in  Column  14  is  the 
present  worth  of  all  expenditures  over  the  complete  planning  horizon,  again  in  thousands  of  dollars. 

Another  run  is  presented  in  Table  2 which  differs  only  in  the  CIP  chosen.  This  illustrates 
how  one  can  study  the  consequence  of  various  alternative  CIP's.  There  we  assume  that  in  order 
to  save  CIP  investment  cost  (Ct),  we  will  stop  the  program  at  year  1978,  thus  achieving  only  a 
minimum  component  failure  rate  of  0.001  failures  per  day.  By  so  doing,  we  save  the  two  large 
expenditures  in  1979  and  1980,  and  immediately  drop  to  the  $350,000  per  year  needed  to  maintain 
the  achieved  reliability  of  0.001  failures  per  day.  However,  in  comparing  the  present  worth  of  the 
cost  streams  for  the  two  cases,  we  see  that  it  actually  costs  about  $5  million  more  to  do  this,  since 
we  require  more  servers  and  spares.  Thus,  in  this  case  the  added  CIP  investment  is  well  worth 
the  expenditure. 

6.  PROGRAM  PERTURBATION  CAPABIUTY 

The  heuristic  algorithm  operates  on  a year-by-year  basis,  and  hence  has  the  limitation  that 
it  does  not  "look  ahead.”  While  we  believe  the  heuristic  algorithm  does  a good  job  in  giving  an 
optimal  or  near  optimal  point  (c,  y)  for  any  year,  when  considering  the  entire  planning  horizon,  it 
may  not  prove  as  effective.  For  example,  consider  Table  1 once  again,  particularly  year  1980.  We 
see  that,  for  that  year,  a jump  from  12  servers  in  1979  to  15  in  1980  is  required,  but  in  the  following 
year  only  13  servers  are  needed.  Recall  that  no  explicit  salvage  value  is  received  for  decreasing 
servers.  This  in  itself  is  not  too  serious,  except  we  note  that  for  1984  we  must  increase  servers 
back  up  to  15,  having  to  purchase  two  additional  servers  for  that  year.  Had  we  kept  the  15  from 
1981,  we  would  not  have  needed  to  make  further  server  purchases. 

But  more  importantly,  perhaps  we  could  have  avoided  going  to  15  servers  in  1980  had  we 
purchased  an  additional  spare,  especially  since  it  was  needed  for  year  1981  anyway.  The  algorithm 
allows  us  to  go  back  to  1980  and  use  as  a starting  point  e=12,  ]/=13  (instead  of  the  1979  solution 
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e=12,  y=  1 1) . This  may  give  a different  e,  y schedule  from  that  year  on,  which  might  have  a cheaper 
fixed  present  worth.  This  capability  to  go  back  to  any  year  in  the  schedule  and  use  a starting 
point  other  than  the  previous  year’s  c,  y we  refer  to  as  the  perturbation  capability.  Using  a different 
starting  point  (if  chosen  “intelligently”)  might  get  us  to  a different  feasible  solution  point  which 
avoids  temporary  increases  in  either  c or  y.  The  program  then  continues  its  calculation  from  the 
year  for  which  the  perturbation  is  desired,  adding  the  new  cost  stream  with  the  correct  discount 
factor  from  that  year  forward,  giving  us  the  present  worth  for  the  new  schedule  which  may  result. 
Table  3 shows  this  effect  for  perturbing  the  Table  1 solution  at  year  1980,  using  the  starting  point 
e=12,  v=13  rather  than  the  1979  solution  of  c=12,  y=\\.  We  see  for  the  new  resulting  schedule 
(which  differs  only  for  year  1980)  that  the  total  present  worth  is  now  about  $39.05  million,  a savings 
of  approximately  $0.1  million.  Further  possible  perturbations  are  also  shown  with  the  resulting  costs. 
No  other  obvious  perturbations  are  indicated,  although  earlier  purchasing  to  avoid  inflation  might 
be  evaluated. 

At  first  glance,  it  appears  this  multiyear  programming  problem  might  be  amenable  to  a dynamic 
programming  solution  where  the  years  are  stages  and  c and  y are  the  state  variables.  Aside  from  the 
fact  that  there  are  two  state  variables  (which  makes  the  computations  formidable),  the  problem  is 
not  decomposable.  The  fill-rate  constraint  must  hold  for  each  year.  This  constraint  (e.g.,  for  year  i) 
involves  the  g,,,  which  are  functions  of  the  pn.i  which  in  turn  are  functions  not  only  of  the  decision 
variables  for  year  i (ci,  ^,) , but  also  of  all  previous  (;’s  and  y’s,  since  X,  is  a function  of  X|-i  which  is  a 


TABLE  3.  Perturbed  Solutions,  das  Generator — FuU  CIP 
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Year 

Original  Solution 
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(1982) 

(1983) 

c y 

c y 

c y 

c y 

1976 

1976 

1977 

1978 

1979 


3 3 

5 6 

8 8 

10  10 

12  11 


3 3 

5 6 

8 8 

10  10 

12  11 


3 3 

5 6 

8 8 

10  10 

12  11 


3 3 

5 6 

8 8 

10  10 

12  11 


1980 


16  12 


12  13 


12  13 


1981 

1982 

1983 

1984 

1985 


13  13 

14  13 

13  14 

16  14 

16  14 

$39, 146, 380 


13 

13 

14 

13 

13 

14 

15 

14 

16 

14 

$39, 061, 460 


13 

13 

13 

14  1 

13 

14 

15 

14 

16 

14 

$39,  046,  080 


13 

13 

14 

13 

14 

14 

16 

14 

16 

14 

$38,  995, 440 


Present  Worth 


532 


D.  GROSS,  H.  D.  KAHN  AND  J.  D.  ICARSH 


function  of  Xi_j,  etc.  (see  (2)).  This  prevents  starting  at  the  last  stage  and  proceeding  backward. 
Only  if  the  reliability  were  constant  throughout  the  multiyear  planning  horizon  could  dynamic 
programming  be  attempted.  However,  we  believe  that  the  heuristic  algorithm,  coupled  with  the 
perturbation  capability,  does  provide  good,  although  not  necessarily  optimal,  schedules. 

S.  SERIES  QUEUEING  MODEL 

The  queueing  model  portion  of  the  program  given  by  liquation  (1)  assumes  that  removal, 
transportation,  and  repair  are  a single  service  action,  so  that  if  all  servers  are  busy  and  a component 
fails,  it  must  wait  in  a queue  prior  to  removal  for  an  available  server.  Once  removal  is  initiated, 
no  further  queues  for  transportation  or  repair  are  encountered. 

In  many  situations,  the  removal,  transportation,  and  repair  phases  may  be  separate,  each 
having  their  own  “servers”  with  their  own  queues.  To  model  this  situation  would  require  the  pn,\ 
and  (f,,  I to  be  determined  by  a closed  network  or  cyclic  queueing  model  which  can  account  for  the 
finiteness  of  the  population,  but  this  requires  quite  involved  calculations  (see,  for  example,  refs. 
(3],  [10],  and  [11]).  The  question  then  naturally  arises  as  to  how  crucial  the  finite  source  assump- 
tion might  be,  since  rather  simple  queueing  formulas  exist  for  infinite  source  series  queues  with 
exponential  arrival  and  service  patterns.  For  example,  suppose  for  component  removal,  Ci  removal 
teams  are  available  for  the  system  (ci  could  equal  the  population  size  N,  in  which  case  we  have  an 
ample  server  model) . Assuming  that  removal  times  are  exponential  and  failures  are  Poisson,  we  can 
model  the  removal  portion  by  an  M/M/ci  queue.  The  output  of  such  a queue  (assuming  an  in- 
finite population)  is  Poisson.  Further,  suppose  there  were  ct  transport  vehicles  available  to  the  sys- 
tem for  shipping  components  to  the  repair  depot.  The  transportation  phase  would  then  be  an 
MjMjct  queue  if  we  assume  that  transport  times  are  exponential.  P^ally,  if  we  assume  that  there 
are  c*  repair  channels  (again  with  an  infinite  population),  the  repair  portion  becomes  an  MIM/ct 
queue. 

If  we  let  pi^i  be  the  probability  of  n components  at  thejth  phase  (j=l,  2,  3)  for  year  i,  it  can 
be  shown  (see  Gross  and  Harris  [4],  p.  203)  that  the  joint  probability  of  f in  the  removal  portion,  m 
in  the  transportation  portion,  and  n in  repair  for  year  i (call  pi.*,  «)  is  given  by 

Pi.*.*i=Pi1ip.?*p"«- 

If  the  year  subscript  i is  dropped  for  the  time  being,  the  fill-rate  constraint  for  each  year  corre- 
sponding to  Equation  (5)  becomes 

m • 

where  the  summation  is  taken  over  all  combinations  of  l,m,n  such  that  (f+m-+-n<y— 1).  Here 
the  equal  the  p\P , as  we  are  assuming  an  infinite  source  model.  Since  is  readily  computable 
for  MjMjc  models,  a series  queue  representation  is  possible  as  long  as  wo  can  justify  approximating 
the  finite  source  situation  by  an  infinite  source  model.  If  we  assume  for  the  moment  that  we  can 
(this  will  be  discussed  more  fully  below),  then  the  previous  methodology  can  bo  used  if  we  sub- 
stitute for  the  previously  used  g.,  < and  p,.  < the  sum  of  probabilities 


QUEUSmO  MODELS  FOR  SPARES  PROVISIONINO 


533 


with  the  arrival  rate  for  this  mfmite  source  series  queueing  model  adjusted  to  be 
(7) 


Once  again,  if  we  drop  the  year  subscript  i for  convenience,  the  standard  MjMjc  formulas 
(see  Gross  and  Harris  [4],  p.  96,  for  example)  give 


(8) 


(») 


!>*"  = 


X* 

X* 


(\<k<e,) 


with  X being  given  by  (7),  Cf  being  the  number  of  servers  for  phases,  and  n,  being  the  mean  service 
rate  for  phase  j (j=l=removal,  j= 2= transportation,  j=3=repair).  In  some  situations,  there 
may  even  be  a fourth  phase  consisting  of  transportation  from  repair  depot  to  spares  pool. 

The  mathematical  programming  problem  which  we  attacked  via  a heuristic  algorithm  is  now 
considerably  more  complex  if  ci  and  Ca  are  also  decision  variables,  since  we  now  seek,  for  each  year, 
the  best  combination  (ci,  Cai  Ct,  y)  and  not  merely  (c,  y).  We  no  longer  have  a single  A,  but  three 
A’s,  say  Ai,  A|,  and  At.  For  equal  dollar  expenditures  then,  we  can  buy  one  spare  or  Ai  removal 
teams,  or  At  transport  vehicles,  or  A|  repairmen.  Further,  there  are  combinations  such  as  aAi 
removal  teams+|9Aa  transport  vehicles+yAi  repairmen  (0<a,  0,  7<1)  which  might  equal  the 
purchase  cost  of  a spare.  Conceptually,  the  extension  to  our  heuristic  algorithm  would  be,  given  a 
particular  «■,  Ct,  Ci,  and  y,  to  calculate  the  increase  in  All  rate  when  moving  to  all  points  of  equal 
expenditure;  that  is,  to  calculate  the  fill  rate  for  (ci,  ct,  Ct,  y+1),  (ci+Ai,  Cj,  «•,  y),  (ct,  Ci+A|,  ct,  y), 
(cii  «i.  ^+At,  y),  and  (ci+oAi,  ei+/3Ai,  Ci+^A*,  y)  for  all  appropriate  a,  0, 7.  For  simplification,  the 
equal  expenditure  points  involving  combinations  of  Ci,  ct,  e%  (those  points  with  the  a,  0,  and  7 
terms)  might  be  ignored  so  that  at  each  step  we  add  either  one  spare,  or  Ai  removal  teams,  or  At 
transports,  or  A*  repairmen. 

If,  on  the  other  hand,  Ci  and  Ct  are  not  decision  variables  (and  in  our  applications  they  were 
not),  then  the  algorithm  need  not  be  modified  since  we  seek  only  “optimal  points”  (ct,  y).  Thus, 
once  the  p.'s  and  ja’s  are  replaced  by  their  series  model  counterparts  as  given  in  Equations  (6), 
(8),  and  (9),  the  algorithm  is  exercised  as  before,  with  Ci  and  Ci  being  fixed  values.  In  the  next 
section,  we  present  some  results  for  such  a case  after  first  considering  the  question  of  accuracy 
when  using  an  infinite  source  approximation  to  a finite  population. 


7.  ACCURACY  OF  INnNITE  SOURCE  APPROXIMATIONS 

In  order  to  use  a series  queueing  model,  the  effect  of  assuming  an  infinite  population  for  calcu- 
lating state  probabilities  when  in  actuality  the  population  is  finite  must  be  investigated.  Let  us 
consider  now  the  nonseries  finite  source  repairman  with  spares  model  given  by  Equation  (1)  and 
its  infinite  population  counterpart,  an  M/Af/c  model  with  an  arrival  rate  adjusted  as  in  Equation 
(7),  that  is,  X«A/^.  In  the  M/MIe  model,  X is  always  N\,  regardless  of  the  number  of  units  down. 
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In  the  finite  source  model,  X is  state  dependent  and,  in  fact,  is  given  in  Section  2 for  the  g,  develop- 
ment as 


A^,  (0<n<y) 


X.= 


(,N—n+y)\, 


(y<n<y+N) 


0,  (re>y+A0, 


where  n is  the  number  of  unite  “down,”  that  is,  in  “repair.”  Now  the  overall  average  arrival  rate 
of  the  finite  source  model  would  be 


= « - »+«  _ 

X=S  X,p,=S  N\p^+  22  (iV— n+y)Xp, 

• -0  «-0  »"»+l 

t+N  _ i+N  - 

= s N\p,-  22  (n-y)\p, 

n-O  n^i+l 

_ _ f+N  _r  f+N  “I 

=iVX-X^^^  (n-y)p,=X  j^iV— (n  — y)p»J' 


which  incidentally  is  how  R of  (3)  is  derived.  Thus,  if 

f+N 

Sin-y)pn 
1 

is  small  in  relation  to  N,  the  infinite  source  approximation  should  be  adequate,  since  \=N\,  the 
quantity  assumed  by  the  infinite  source  model.  This  should  be  the  case  for  systems  with  not  too 
much  congestion  (p,  small  for  n^y),  y and  N large.  The  fact  that  a fill  rate  of  90%  must  be  guar- 
anteed should  require  that  c and  y be  large  enough  such  that  there  is  a relatively  small  amoimt 
of  congestion  in  the  system,  thus  making  the  approximation  good  even  for  small  N. 

Table  4,  first  and  third  blocks,  show  comparisons  for  the  two  cases  run  in  Tables  1 and  2, 
respectively,  with  an  infinite  source  Af/M/c  used  in  calculating  p,„.  The  results  differ  only  slightly, 
with  the  infinite  source  model  requiring  an  additional  server  or  two  in  a few  of  the  years.  The  in- 
finite source  model  will  occasionally  require  slightly  higher  c (or  y)  than  the  finite  source  model  in 
order  to  meet  the  availability  constraint.  This  is  caused  by  the  lack  of  state  dependence  in  the 
arrival  rate  for  the  infinite  source  model,  which  has  the  effect  of  making  the  infinite  source  arrival 
rate  slightly  higher  than  the  arrival  rate  for  the  corresponding  finite  somce  model.  Hence,  the 
infinite  source  model  is  a conservative  approximation  with  respect  to  fill  rate. 

Since  the  infinite  source  approximation  appears  to  give  good  results,  we  have  run  an  infinite 
source  three-stage  (removal,  transport,  and  repair)  series  model,  using  the  same  data  as  in  Tables 
1 and  2,  but  assuming  that  removal  is  M/Mjoo,  transport  is  M/M/oo,  and  repair  is  MIM/e  (c  to  be 
determined  as  before) . 

Actually,  for  ample  server  queues,  it  is  not  necessarj''  to  assume  exponential  service  since  the 
output  of  MIQjoo  is  also  Poisson  (see,  for  example,  Mirasol  [7]).  Thus,  these  results  also  hold  for 
M/Q/oo  removal  and  transport  phases.  The  mean  single-stage  repair  time  is  allocated  to  the  multi- 
stage model  so  that,  on  the  average,  5%  is  consumed  in  removal,  20%  in  transportation,  and  76% 
in  repair.  These  results  comparing  the  two  models  are  also  shown  in  Table  4.  Here,  in  many  of  the 
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TABLE  4.  Comparison  of  Finite  Source,  Infinite  Source,  and  Series  Models 


Case  1 — Full  Component 
Improvement  Program 

Case  2 — Partial  Component 
Improvement  Program 
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years,  fewer  servers  are  required  to  achieve  the  90%  fill  rate  when  treating  removal,  transporta- 
tion, and  repair  separately.  This  is  as  we  would  expect,  since,  if  these  three  phases  are  separate, 
removal  and  transportation  can  occur  even  if  there  is  a queue  at  the  repair  facility.  By  treating  the 
separate  phases  together  as  one  service  operation,  as  in  the  single-stage  model,  we  are  in  essence 
requiring  a free  repair  facility  prior  to  removal,  in  which  case  the  repairman  would  be  idle  until 
the  component  arrived.  Thus,  in  the  situation  where  removal  and  transportation  have  ample  serv- 
ers and  queueing  takes  place  only  at  repair,  conservative  results  are  obtained  if  we  treat  the  three 
separate  phases  as  one.  If  indeed  the  service  portion  is  made  up  of  separate  phases,  a more  realis- 
tic representation  of  the  system  is  obtained  by  using  the  series  model,  in  spite  of  the  slight  inac- 
curacy caused  by  invoking  the  infinite  source  assumption  when  calculating  probabilities.  With 
fill-rate  constraints  of  90%  or  better  which  guarantee  relatively  little  congestion,  it  appears  that 
the  inaccuracies  caused  by  using  the  infinite  source  assumption  in  the  series  model  are  negligible 
as  compared  to  those  that  result  when  using  a single-stage  finite  source  model  if  the  service  actu- 
ally is  multistage. 
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ABSTRACT 

The  problem  of  multiple-resource  capacity  planning  tmder  an  infinite  time 
horison  is  analyzed  using  a nonlinear  programming  model.  The  analysis  generalizes 
to  the  long  term  the  short-nm  pricing  model  for  computer  networks  developed  in 
Kriebel  and  Mikhail  [5].  The  environment  assumes  heterogeneous  resource  capa- 
cities by  age  (vingate),  which  service  a heterogeneous  and  relatively  captive  market 
of  users  with  known  demand  functions  in  each  time  period.  Total  variable  operating 
costs  are  given  by  a continuous  psuedoconcave  function  of  system  load,  capacity, 
and  resource  age.  Optimal  investment,  pricing,  and  replacement  decision  rules 
are  derived  in  the  presence  of  economies  of  scale  and  exogenous  technological 
progress.  Myopic  properties  of  the  decision  rules  which  define  natural  (finite) 
planning  subhorizons  are  discussed. 

1.  INTRODUCTION 

During  recent  years,  a number  of  articles  have  appeared  in  the  management  science  and 
electronic  data  processing  literature  concerning  the  allocation  and  control  of  computer  system 
resources,  cf  Ref.  [5].  The  basic  issue  of  resource  allocation  and  control  in  general  is  not  new,  of 
course,  and  both  economists  and  accountants  have  studied  the  question  for  many  years,  e.g., 
[8,  10].  Notwithstanding  this  fact,  some  contemporary  authors  have  argued  (rightly  or  wrongly) 
that  computer  systems  are  different:  the  particular  characteristics  of  computer  technology,  the 
heterogeneity  of  computer  users,  cost-benefit  measurement,  and  the  diversity  of  the  literature  itself 
inhibits  (or  precludes)  the  application  of  a “conventional  theory”  to  this  environment.  For  example, 
the  pricing  of  services  provided  by  computer  system  resources  is  often  an  administrative  decision 
in  practice;  when  arguments  are  advanced  for  marginal  cost  pricing,  counterarguments  are  raised 
that  operating  costs  are  fixed  or  that  joint-cost  allocation  is  indeterminate  (or  arbitrary),  and  so  on. 

In  a previous  article  [5],  the  general  issue  of  pricing  computer  resources  was  discussed  at  some 
length  from  both  the  theoretical  perspective  of  a short-run  price-capacity  decision  model  and  the 
pragmatics  of  implementing  the  resulting  decision  rules.  In  this  sequel,  we  extend  the  nonlinear 
programming  model  to  derive  the  optimal  investment,  pricing,  and  replacement  of  computer 
resources  over  an  infinite  time  horizon  in  the  presence  of  economies  of  scale  and  exogenous  tech- 
nological progress. 
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Several  authors  have  used  this  programming  formulation  to  address  the  issue  of  optimal 
capacity  selection  and  to  infer  pricing  policies  in  an  environment  of  varying  intertemporal  demand. 
Kriebel  and  Mikhail  [5]  considered  the  acquisition,  use,  and  pricing  of  a multiple-resource  computer 
network  over  a (short-run)  fixed  planning  horizon  where  capacities,  once  selected,  remain  constant 
throughout.  The  model  proposes  a dynamic  pricing  policy  and  imputes  the  total  revenue  for  the 
output  onto  the  individual  resources. 

Littlechild  [6]  introduced  the  concept  of  individual  and  distinguishable  vintages  of  a single 
resource  being  capable  of  producing  homogeneous  output.  The  equipment,  however,  is  characterized 
by  an  increasing  operating  cost  function  as  the  vintage  becomes  older.  Although  resources  were 
implicitly  assumed  to  have  an  infinite  productive  life,  technological  improvements  in  subsequent 
vintages  imparted  a finite  economic  life  to  any  particular  vintage  whereby  the  machine  simply  fell 
into  disuse  because  of  its  relative  cost  inefficiency. 

Baumol  [2]  homogenized  all  vintages  by  assuming  constant  technology,  but  aspects  of  physical 
deterioration  of  capacity  were  incorporated  in  the  model.  While  Littlechild  assumed  assets  of  the 
“one  hoss  shay”  type,  which  provides  constant  capacity  untU  expiration,  Baumol  explicitly  allowed 
for  systematic  decline  in  capacity. 

Thus,  while  neither  author  addressed  the  sale  of  equipment  as  a vehicle  to  adjust  capacity, 
there  were  de  facto  approaches  to  model  the  expiration  of  capacity.  In  the  model  developed  below, 
we  provide  directly  for  the  sale  of  capacity  and  derive  explicitly  the  optimality  conditions  associated 
with  the  disposal  decisions.  We  also  maintain  the  concept  of  vintages  and  assume  that  operating 
cost  is  a monotonic  increasing  function  of  age.  It  should  be  noted  that  the  physical  decline  of 
capacity  can  be  easily  adopted  in  our  model,  but  the  notion  is  deemed  less  applicable  in  the  computing 
environment. 

Beyond  incorporation  of  the  resource  replacement  decisions,  our  analysis  yields  a natural 
(finite)  planning  subhorizon  for  the  general  problem,  and  thus  provides  an  explicit  analytical 
linkage  to  the  previously  decomposed  short-run  model  [5].  In  the  next  section,  we  summarize 
the  major  assumptions  and  rationalization  for  applying  the  model  to  the  intended  environment. 
The  demand  and  production  functions  are  then  described,  followed  by  a complete  specification  of 
the  infinite  horizon  model.  In  Section  4,  the  formal  results  of  the  analysis  are  given  which  yield 
the  corresponding  optimal  decision  rules.  Finally,  contributions  of  this  paper  are  summarized 
and  additional  extensions  of  the  basic  framework  are  addressed. 

2.  ENVIRONMENTAL  AND  ANALYTICAL  ASSUMPTIONS 

The  prospect  of  employing  a “market  pricing  system”  for  the  control  and  allocation  of 
computer  services  to  users  depends  upon  several  basic  assumptions.  To  begin,  we  assume  that 
users  are  rational  decision  makers  in  consuming  services,  and  that  this  rationality  is  reflected  by 
individual  demand  functions  for  computation  which  are  known  to  the  individual  and  to  the  center 
in  aggregate.  For  example,  the  user’s  demands  can  be  presumed  to  have  been  derived  on  the  basis 
of  the  solution  to  a utility  maximization  problem  for  each  individual  (or  “job  class  group”).  More- 
over, we  a.ssume  that  the  level  or  quality  of  service  provided  to  each  user  is  completely  reflected 
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in  the  price-requirements  relation  of  the  demand  functions  for  each  service;  i.e.,  service  quality  is 
constant  at  the  price  charged  to  each  user  of  the  service  in  a given  time  period.'" 

Thus,  the  issue  of  intertemporal  imcertainty  is  ruled  out  at  the  outset.  We  assume  that  users 
can  and  will  alter  their  pattern  of  use  over  time  in  response  to  changes  in  the  price  of  an  output 
service.  Furthermore,  changes  in  the  posted  prices  for  each  output  service  by  time  period  are 
assumed  to  be  administratively  feasible.** 

The  production  function  of  the  center  for  each  output  service  (job  class)  in  each  time  period 
is  presumed  known  and  depends  on  the  input  requirements  of  the  output  for  a set  of  resources  and 
the  capacity  of  the  individual  resources  available  in  the  period.  For  each  resource  class,  different 
vintages  (e.g.,  equipment  on  hand  of  different  ages)  are  assumed  to  be  homogeneous  with  respect 
to  service  capability  provided.  Resource  capacity  imits  are  assumed  to  be  continuous  and  resources 
are  perfectly  divisible  for  purchase  and  disposal  decisions.  The  structure  of  the  model  dictates  that 
all  demands  are  prccessed  within  the  period  of  arrival.  Thus,  for  example,  if  at  time  ( a user  observes 
a forward  price  in  period  t-|-  T that  is  lower  than  the  current  price,  and  he  can  delay  his  consumption, 
rationality  dictates  that  he  will.  This  fact  rules  out  potential  internal  economies  from  job-splitting 
and  ensures  one-period  turnaround  time  for  jobs.f 

In  formulating  the  problem,  the  center  seeks  to  plan  over  an  infinite  time  horizon  with  the 
objective  of  maximizing  the  net  discounted  value  to  the  firm,  which  is  the  sum  of  producer  and 
consumer  surplus.  ffWe  assume  that  total  variable  operating  costs  are  pseudoconcave  as  a continuous 
function  of  the  load  on  the  center,!  that  economies  of  scale  exist  in  operations  with  respect  to 
resource  capacities,  that  exogenous  improvements  in  computer  technology  occur  over  time,  and  that 
existing  resources  economically  deteriorate  with  age. 


3.  AN  INFINITE  HORIZON  MODEL 

The  basic  model  of  interest  is  a nonlinear  programming  characterization  which  maximizes  the 
net  discounted  value  to  the  firm  over  an  infinite  planning  horizon  subject  to  known  constraints  on 

*A8  discussed  in  Ref.  [5],  separate  consideration  of  service  quality  can  be  accommodated  within  the  model 
below;  it  is  omitted  here  for  analsrtical  simplicity.  See  also  the  explicit  treatment  of  service  quality  in  Ref.  [4]. 
For  example,  quality  might  be  reflected  in  the  availability  of  different  products  with  different  prices  and  different 
demands  as  a function  of  time,  or  by  different  opportunity  costs  to  users  incorporated  into  either  the  variable 
operating  cost  functional  of  the  model  or  into  individual  user’s  ability  and  willingness  to  pay  for  quality. 

♦♦There  is  no  question  that  uncertainty,  congestion,  and  reliability  figure  prominently  in  the  environment  of 
computer  services.  We  would  argue  that  this  occurs  primarily  on  a microlevel  time  frame,  e.g.,  job-to-job,  hourly, 
within  shift,  or  daily.  As  the  time-period  interval  is  extended,  e.g.,  to  months  or  quarters,  these  considerations 
rapidly  diminish  in  importance  through  aggregation,  and  the  certainty  assumption  becomes  less  critical.  The  more 
aggregated  view  is  our  intentional  reference,  and,  similarly,  the  administrative  consideration  for  price  changes  is 
less  of  an  issue. 

t Again,  we  emphasize  the  time  period  interval  under  consideration  (see  above  footnotes). 

ttThe  objective  of  maximizing  the  net  discounted  value  to  the  firm  is  the  conventional  assumption  of  welfare 
economics;  e.g.,  see  Ref  [4]  for  an  elaboration  in  this  context.  An  alternate  formulation  is  to  consider  the  renter  as  a 
discriminating  monopolist  (with  limitations  on  the  monopoly  power)  facing  a relatively  captive  market  of  demand; 
this  analysis  was  illustrated  in  Ref  [5]  and  the  cross  relations  are  shown  below.  However,  the  monopoly  model  will 
result  in  suboptimizations  for  the  firm.  In  practice,  most  captive  centers  consider  demand  exogenous  and  usually 
price  services  to  balance  (negotiated)  budget  costs  on  an  annual  basis;  e.g.,  see  Ref.  [9]. 

tSee  Ref.  [6],  pg.  107-108. 
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output  (job  class)  demands  in  each  time  period,  production  functions  for  system  output  in  terms  of 
resource  requirements  for  each  service,  net  capacity  restrictions  for  each  resource  class,  and  feasi- 
bility requirements  on  the  decision  variahles.  The  decision  variahles  of  the  problem  are  the  invest- 
ments in  resource  capacities  over  time,  the  disposal  (sale)  of  obsolete  (inefficient)  resources,  and  the- . 
unit  prices  to  charge  users  in  each  time  period.  The  specific  notation  of  the  model  follows. 

Consider  a computation  center  environment  comprised  of  «=1,  2,  S shared  resources 
(servicing  facilities)  and  j=l,  2,  J outputs  (job  classes).  In  each  time  period  t=l,  . . ., 
r,  . . .,  the  processing  of  users’  requests  for  service  consists  of  the  production  of  an  output  which 
imposes  requirements  on  the  shared  resources  of  the  center.  Let  represent  the  demand  of  job 
j in  time  period  f,  which  in  turn  is  composed  of  a set  of  resource  requirements  {yyn}.  * Let  pi,  represent 
the  unit  price  (rate)  for  service-charged  output  j at  time  t,  which  is  a function  of  the  quantity  of 
service  demanded;  i.e., 

(1)  Pu=Dj,(y„),  j=l, . . .,Ji  t=l,  2, . . . 

We  assume  that  the  “demand  curve”  in  (1)  is  downward-sloping  for  all  outputs — both  individually 
and  collectively  larger  quantities  of  service  will  be  demanded  at  lower  prices,  and  conversely. 
At  a given  point  in  time,  the  center  may  have  more  than  one  vintage  of  each  resource  available 
for  production;  i.e.,  equipment  of  different  age  depending  upon  the  time  period  in  which  it  was 
originally  purchased.  Let  the  subscript  v correspond  to  the  vintage  of  a resource  in  time,  for  e=l, 
2,  . . .,  t.  We  assume  in  a time  period  that  the  given  resource  requirements  of  each  output  may 
be  processed  (serviced)  by  any  one  or  several  of  the  vintages  of  that  resource  available  for  produc- 
tion in  the  period.  In  particular,  the  production  function  for  each  output  j and  time  period  t is 
given  by 

Vu—^uiUmt  • • •>  Vtft  • • •! 

where 

( 

y>i.=g 

for  «=1,  . . S and  y/it.is  the  output  requirements  for  j,  t of  resource  categoiy  a processed  on  (or 
by)  facilities  of  vintage  o.f  For  the  production  function  in  (2),  we  assume  that  the  marginal  product 

90«(a.,  K,  e,)ldb,mg^^{b,  •)  or  eimply  j,*. 

of  a resource  for  each  and  all  outputs  does  not  depend  on  vintage. t However,  the  variable 
operating  cost  of  output  is  a function  of  resource  vintage  employed,  viz,  variable  costs  increase 
by  a factor  0 per  period  as  a resource  ages,  where  0< 1.  That  is,  a resource  cannot  become  less 
expensive  to  operate  as  it  becomes  older.  All  resource  capacity  acquisitions  are  new  when  pur- 
chased; at  time  t,  only  resources  of  vintage  t may  be  purchased.  As  a convention,  we  assume  that 

*For  example,  the  yui  variables  might  correspond  to  central  processor  time,  input/output  requests,  primary 
memory  space,  etc. 

fFor  convenience  and  to  simplify  notation,  we  will  employ  the  following  conventions  in  the  remainder:  all  func- 
tions are  assumed  differentiable,  and,  as  in  (1)  and  (2),  will  be  designated  by  capital  letters;  their  derivatives  will 
be  written 

IThat  is,  for  given  s we  assume  fii’iyii.,  •)=//i*(V(i..  •)  ...,<. 
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the  physical  acquisitions  occur  just  prior  to  the  start  of  a period  so  that  the  capacity  is  available  for 
production  in  the  period  of  purchase;  symmetrically,  a sale  of  capacity  in  a time  period  precludes 
its  availability  for  production  in  the  period  of  sale. 

For  each  time  period  (=1,  2,  . . let 

^=the  discount  or  interest  rate; 

ZM=the  capacity  of  resource  « purchased  in  period  t (of  vintage  t only) ; 
9i„=the  amount  of  capacity  of  resource  class  « and  vintage  v sold  in 
time  period  t; 

I 

S 2TM=tho  net  capacity  of  output  service  produced  by  re- 

r»» 

source  class  « of  vintage  v in  time  period  (; 

j 

yitt,=tho  total  output  on  vintage  “equipment”  v of  resource 
class  « in  time  period  t; 

^M(za)=the  investment  costs  of  resource  capacity  for  resource  « at  time  t; 
salvage  value  of  qu,  in  period  t; 

(l+9y~’C„(yu„  Xuf)=the  total  variable  operating  cost  in  time  period  ( for  resource 
class  < of  vintage  c.  The  function  is  assumed  to  be  pseudoconcave 
in  yuf* 

As  a general  statement,  we  assume  (iiat  economies  of  scale  in  each  time  period  and  technological 
progress  over  time  are  reflected  in  the  investment  cost  functions  Concerning  the  salvage 

value  functions,  we  assume  that  they  are  resource,  vintage,  and  time  specific.  Further,  the  salvage 
value  of  an  asset  is  less  than  (or  equal  to)  its  historical  cost  at  time  of  purchase,  and,  for  a given 
resource,  the  salvage  value  declines  with  age  (vintage). 

The  objective  function  is  to  maximize,  with  respect  to  yfu;  Zx,  ei* J qu„  the  discounted  value 
to  the  firm  given  by 

(3)  0“g{a+p)''  (sr 

-(l+p)-<->  g i?..(2,..))} 

subject  to  the  capacity  constraint 

j I 

(4)  2»fti  •=!,  • • •,  S;  0=1,  . . ., 


*Not«  that  in  the  lequel  analysis  of  (3)  below,  we  assume  that  the  discounted  total  cash  flows  are  also  pseudo- 
concave  functions.  That  i^  let  Ht  be  differentiable  pseudoconcave  functions  K and 

JT  K K 

Af(*)  — ^ for  Then  we  assume,  respectively,  that  Vff(x)'(v—x)  = ^ Hi,(y)  ^ ^ Af»(*)  for  any 

« and  v,  where  v/fs(s)'—(dffi(a)/dsi,  . . .,  djft(s)/dzj. 


C.  H.  KRIEBEL,  A.  A.  ATKINSON  AND  H.  W.  H.  ZIA 


the  non-negativity  constraints 


yy«..>0,  x„>0,  2,„>0;  -Vj,  «,  v,  t, 


the  demand  function  in  (1),  and  the  production  function  in  (2).  Finally,  let  mui  be  the  dual  variable 
associated  with  the  capacity  constraint  in  (4). 

By  assumption,  (3)  is  pseudoconcave  and  (4)  describes  a convex  set.  Consequently,  the  follow- 
ing Kuhn-Tucker  conditions  become  necessary  and  sufficient  for  an  optimum  solution  to  the  pre- 
ceding programming  problem:* 

(6)  ■)<(l+ey—e„’{yi,„  O + fl+plVut,  y«.r>0,  for  t>v,  and  all  j,  t,  a,  v. 

(7)  x„)>S  Mr..,  x.,>0,  for  all  a,  v. 

r— p 

(8)  (l+p)“''"‘’2(.'-l-S  (l+p)“'(l+®)'"'c..*(-,  2...)<  S Mr..,  2...>0,  for  «>»  and  all  t,  a,  v. 

r—t  r»(>r 

t _ 

(9)  z,.— S 2i..^yi...  Mr.,>0  for  all  t,  a,  v. 

T-» 

It  is  understood  for  the  above  that  direct  substitution  is  made  for  the  demand  function  in  (1) 
and  the  production  function  in  (2),  and  that  the  complimentary  slackness  conditions  are  to  hold 
within  each  pair  of  constraints.  That  is,  if  one  holds  with  strict  inequality,  the  other  holds  with 
strict  equality,  and  conversely.! 

4.  OPTIMAL  DECISION  RULES 

In  interpreting  these  results,  we  first  note  some  general  conditions  which  pertain  to  opera- 
tion of  the  facilities  over  time. 

THEOREM  1:  For  every  resource  class  «=1,  . . S 

1.1  Vintages  are  utilized  in  decreasing  order  of  efficiency,  which  is  by  age  with  newest 
capacity  first. 

1.2  In  a period  where  excess  capacity  exists  on  the  newest  vintage  of  the  resource,  no 
output  will  be  scheduled  on  older  vintages,  and  the  dual  variables  n,„  are  zero  for 
all  vintages. 

1.3  Every  vintage  purchased  is  capacity-constrained  in  at  least  one  time  period. 
PROOF:  Suppose  that  more  than  one  vintage  of  a resource  is  available  for  production  at 

time  t where  ...  >1,  and  for  v,,  the  most  efficient  vintage,  assume  that  excess  capacity 

exists;  that  is,  from  (9), 

j 

*For  convenience  in  the  discussion  to  follow,  we  assume  that  the  constraint  set  of  the  problem  has  been  reduced 
such  that,  at  boundary  values,  there  are  no  redundant  binding  constraints.  Hence,  a strict  interpretation  of  com- 
plimentary slackness  is  presumed  valid.  In  general,  without  the  assumption  for  problems  of  this  type,  the  strict 
conditions  may  not  hold. 

t Note  for  x,,,  in  (8)  that  }i.,)  = x„)  by  the  chain  rule. 


By  (6),  this  implies 
(10) 
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Now  suppose  statement  1.1  above  is  false  and  for  c^<0i,  yjut^O.  Substituting  for  Vt  into  (6) 
and  combining  the  result  with  the  condition  for  th  gives 

(11)  (l+*)'-*el.(y>,.,  •)=(l+»)'-'*cS,(y«.,  O+d+p)^. 

Since  inm,  is  strictly  nonnegative,  (11)  implies  the  contradiction 

(12)  (H-»)'-C(y/...  •)>(i+«)‘-cS.(y/..,  •). 

Thus,  the  assumption  on  variable  operating  costs  generates  an  economic  preference  ordering  for 
vintage  utilization. 

Statement  1.2  is  a direct  consequence  of  1.1  and  (9),  since,  for  , >1, 

( 

(13)  yi«=y..<a:ttf.-*p..,=0  for  all  v 
»ndy„=ito„. 

Statement  1.3  follows  from  the  problem  description  and  (7),  since  resource  capacity  is  a 
decision  variable 

for  at  least  one  t=v,  o+l,  . . . Q.E.D. 

The  dual  variable  itt„  may  be  interpreted  as  a quasi-rent  for  the  period  of  the  given  vintage  of  the 
resource,  and  is  defined  in  the  next  theorem. 

THEOREM  2:  The  dual  variable  mm*  is  equal  to  the  present  value  of  the  difference  between 
the  actual  marginal  operating  cost  for  each  vintage  of  a resource  class  « and  the  marginal  operating 
cost  of  the  least  efficient  vintage  of  that  class  employed  in  production  at  less  than  capacity  during 
the  time  period;  i.e.,  for 

(14)  M,..=(H-p)-‘{(i+*)'-*cr..(y«.,  •)-(H-«)‘-c;.(y,..,  •)} 

PROOF:  Suppose  that  in  period  t,  the  total  requirements  of  a given  resource  exceed  vintage 
capacities  down  to  but  not  exceeding  a particular  vintage  c*,  where  t>0i>t^>  . . . >t>*>l. 
That  is,  let 

(13)  y«»~yMi»"i" 

From  (6)  and  Theorem  1,  we  have  for  all  af>o* 

(l«)  y«*(y«.,  •)  Z?„(y«)=(H-«)‘-'‘cS,(y«.,  O-Kl-f,)) W, 

and  for  Vt=v* 

(17)  /«*(y«..  •)2?/.(yi.)-(H-*)'— cr..(y«...  •)• 

Since  (16)  and  (17)  must  hold  simultaneously  for  all  vintages  and  each  resource,  direct  substitution 
yields 

p«..=(H-p)-‘{(i+4)'-cr..(w,  •)-(H-#)‘-‘<*.(y/...  •))} 


aa  stated. 


qj:.d. 
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Regarding  Theorem  2,  the  formal  interpretation  is  valid  only  when  the  least  efficient  vintage 
is  employed  at  less  than  capacity  during  the  time  period,  as  stated.  Under  the  case  where  the 
marginal  vintage  is  also  used  to  capacity,  the  equations  in  (14)  are  underidentified  and  the  alloca- 
tion may  appear  arbitrary,  despite  the  fact  that  the  e*  vintage  can  always  be  identified  for  each 
resource  class  in  any  time  period  on  the  basis  of  marginal  operating  costs.* 

DEFINITION  1 : Define  the  quasi-marginal  cost  (MC,^,)  of  a vintage  resource  (a,  v)  as  the 
sum  of  the  actual  marginal  operating  cost  at  total  output  (MC|„)  and  the  future  worth  at  ( of  the 
quasi-rent, 

(18)  MCS,-(l-|-®)‘-(!,/(y«.,  O+d+p)^. 


COROLLARY  (Theorem  2) : In  each  time  period,  the  quasi-marginal  cost  of  a resource  class  a 
is  the  same  for  all  vintages  of  the  given  resource  employed  in  production. 

PROOF:  Trivial  from  Theorem  2:  MCX,=MC|,,.=MC,*  -F k.  Q.E.D. 

We  now  state  the  optimal  pricing  decision  rule  for  each  job  class. 

THEOREM  3 : In  each  time  period  t, 

3.1  For  every  job,  the  value  of  the  marginal  product  of  each  resource  class  a must  equal 
its  quasi-marginal  cost; 

3.2  The  optimal  price  for  each  job  (p*i)  is  equal  to  the  quasi-marginal  cost  of  the  job 
(MC/,) ; i.e.,  for  some  a, 

(19)  y«.>0-»y«>0-*pJ.=MC;, 

PROOF:  The  first  statement  follows  from  (6)  and  the  preceding  corollary;  viz., 

(20)  •)=MC»4^j. 

It  is  also  true  that  for  each  given  j,  (20)  holds  for  every  a,  and,  further,  for  the  quasi-marginal 
cost  of  a resource  for  job  j, 

(21)  MC;,.=MC,-i 

Multiplying  both  sides  of  (20)  by  dyn,  and  summing  over  all  resources,  we  obtain 

MCJ  dy^i, 

(22)  = dm  =p?. 

^/*«(y«t,  •)#«. 


We  know  that,  for  the  marginal  cost  of  each  job, 

(23)  UCu=dCJdy„ 

and  the  respective  total  differentials  can  be  written  as 


(24) 


dC,  S 


*In  this  case,  one  can  interpret  mm?*  as  approximately  equal  to  aero;  however,  this  informal  interpretatton  is 
notwithstanding  the  formal  conditions  below.  Tliat  is,  we  obtain  exact  identification  for  all  the  aii>  variables  with 
the  addition  of  (36)  below. 
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By  uudogy  from  (24),  we  can  interpret  the  l.h.8.  of  (22)  m the  quen-merginel  cost  of  the  job, 
MC/,.  Computation  of  the  optimal  pricing  rule  can  be  obtained  from  (22)  or  (20)  since 

(M)  pJ.-MC;,-.^^^^^foralls.  Q.E.D. 

We  note  that  the  conditions  in  (20)  and  (24)  permit  the  imputed  allocation  of  costs  (ptioea) 
to  the  reaourees  consumed  by  each  job. 

Turning  our  attention  to  (7)  and  (8),  we  determine  the  optimal  investment  and  replacement 
decision  rulea. 

DEFINITION  2.  The  long-run  marginal  cost  of  resource  capacity  Xu  is  equal  to  the  present 
value  of  the  marginal  investment  cost  plus  the  sum  of  the  discounted  marginal  operating  costs  of 
capacity  in  all  periods  of  use. 

THEOREM  4.  Optimal  investment  and  replacement  decision  rules.  Purchase  Xw  of  resource 
class  « in  time  period  v,  and  sell  (retire)  q,n^x„  in  time  period  (,  (>o,  provided  the  long-run 
marginal  cost  of  resource  capacity  in  the  t—v  periods  of  use  less  the  present  value  of  the  marginal 
return  from  salvage  in  period  t is  equal  to  the  total  quasi-rent  for  the  vintage  over  the  planning 
subhorixon. 

Otherwise,  do  not  invest. 

The  time  frame  r^r<t  constitutes  a natural  planning  subhorizon  of  the  infinite  horizon 
decision  problem. 

(28)  x„>0-*(l-i-a)-‘'-‘>**'-|-2  (l+a)-'(l+#)'-«..*(-,  x«) 

-(l+s)-“-”*./-S  Mm>0,  e-1,  2 ;a-l 8 

and 

(27)  g«,>0-*g,„-»x„,  »<«<•. 


PROOF:  From  (7)  and  (0),  we  have  ^v,  $ 

(n  (l+a)-‘-'’*«*+S  m«.^0-x„50. 

The  marginal  cost  terms  in  tiie  left  inequality  are  strictly  nonnegative  by  assumption.  For  decreasing 
values  of  x*,  equality  is  approached  as  the  successive  long-run  marginal  cost  values  decrease  and 
the  aaaociated  quasi-rent  values  increase.  Unless  equality  occurs  for  some  Xm>0,  (7)  and  (8)  have 
no  physical  meaning.  Therefore,  tiie  firat  relation  in  (7')  must  become  a strict  equality  for  some  v 
and  each  a,  and  the  sum  of  the  quasi-rents  will  be  strictly  positive  by  Theorem  1. 

Now,  over  the  time  frame  e<t£  • we  can  substitute  from  (8)  into  (7')  for  the  quasi-rents; 
simplifying,  we  obtain 

(28)  (l+a)-‘'-"»./;$(H-a)-‘*-”*«*+2  (i+^)  -'a+#)'-'c./(-,  x«)-2  m».. 

t»9 

which  is  a strict  inequality  •Pv,  unless  gM>0  for  some  t>v.  The  discounting  process  guarantees 
that  in  the  limit  as  (-»•,  (28)  must  become  an  equality  by  (7').  To  reach  equality  at  some  (<  •• 
in  (28),  the  salvage  term  (m  the  must  decline  to  a slower  rate  over  time  than  the  r.h.$.  for 
given  Xw;  otherwise,  the  vintage  is  never  sold.  This  implies  that  the  total  quaai-rents  for  the  vintage 
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must  dominate  the  sum  in  all  periods  of  use  of  the  discounted  marginal  operating  costs  of  capacity. 
From  Theorem  2,  when  a vintage  has  become  marginal  and  is  employed  at  less  than  capacity  (or 
is  idle),  Mr«i=0,  a<r=t;  however,  marginal  operating  costs  of  capacity  are  positive  at  all  output 
levels.  Hence,  equality  in  (28)  must  be  obtained  at  or  before  the  period  ( when  the  quasi-rents 
become  zero;  otherwise,  the  inequality  diverges  for  any  0<y,„<z,„  T>t,  in  contradiction  to  (?')• 
The  equality  condition  is  restated  as  (26)  and 

t»<  ». 

The  time  frame  o<r<t  is  thus  a natural  planning  subhorizon  for  z„,  qut  in  the  original  in- 
finite horizon  problem  by  (26).  Moreover,  since  the  discounted  marginal  return  from  salvage  is 
a decreasing  function  of  time,  and  the  quasi-rents  in  (8)  have  been  eliminated  for  all  r>(— 1, 
we  have  (27).  That  is,  whenever  a sale  is  initiated,  all  of  that  vintage  is  sold  (or,  equivalently, 
its  economic  value  is  “written  off”  at  time  t).  Q.E.D. 

From  (26),  we  see  that  the  quasi-rents  iir„  serve  to  explicitly  link  the  market  values  of  the 
outputs  and  the  marginal  costs  of  input  resources;  similarly,  one  can  link  the  costs  of  outputs 
to  the  values  of  imput  resources. 

In  interpreting  the  above,  we  observe  that  the  marginal  operating  cost  of  capacity  (i.e., 
c«>*(‘i  ^n))  can  be  either  positive  or  negative,  depending  on  the  operating  load  on  a given  resource. 
This  element  serves  to  link  the  role  of  production  economies  of  scale  to  the  investment  and  dis- 
posal decisions.  That  is,  for  a low  level  of  output,  the  positive  c„*  induces  a smaller  capacity  in- 
vestment; conversely,  if  the  output  level  is  high,  c,,*  turns  negative,  inducing  a larger  investment 
because  of  savings  in  the  operating  cost  at  the  more  efficient  scale.  Equivalently,  when  output  is 
low,  the  inducement  is  to  dispose  of  "excess  capacity”  sooner;  conversely,  a negative  e,,*  acts  as 
a deterrent  for  the  replacement  decision  because  the  discounted  marginal  salvage  value  must 
match  the  potential  future  savings.  The  natural  planning  subhorizon  can  be  interpreted  in  the 
context  of  a myopic  decision  rule  [1];  in  this  case,  it  provides  an  explicit  analytical  link  to  the 
previous  short-run  model  in  Ref.  [5].  That  is,  it  provides  a finite  planning  subhorizon  within  the 
infinite  time  frame  such  that  following  the  optimal  decision  rules  in  each  siibhorizon  yields  the 
optimal  policy  solution  for  the  infinite  horizon  problem.  Thus,  the  infinite  horizon  problem  can 
be  decomposed  into  finite  subhorizon  problems  whose  solution  is  facilitated  by  the  assessment 
of  a finite  system  of  equations  in  each  case. 

5.  CONCLUSIONS 

In  summary,  this  model  is  able  to  handle  situations  which  previous  papers  have  excluded. 
The  explicit  introduction  of  the  replacement  decision  allows  consideration  of  all  demand  possi- 
bilities between  pure  contraction  and  expansion.  Incorporation  of  the  salvage  value  function 
provides  the  means  to  decouple  the  infinite  horizon  into  natural  planning  horizons  yielding  the 
myopic  rules  previously  unavailable  for  a vintage  type  of  model.  This  analysis  also  generalizes  to 
the  long-term  the  short-run  pricing  model  in  Ref.  [6]. 

In  the  interests  of  brevity  we  have  not  repeated  the  earlier  discussion  concerning  environ- 
mental characteristics  of  computer  services  and  model  implementation,  since  the  comments  in 
Ref.  [6]  obviously  remain  appropriate  in  this  case  as  well.  Beyond  rationalizations,  there  appear  to 
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be  three  general  issues  that  might  be  raised  regarding  the  approach  taken  and  consequently  might 
be  of  interest  for  continuing  research. 

The  first  issue  is  the  question  of  uncertainty  and  its  resolution  in  the  real  world.  Our  bias 
here  is  to  focus  initially  on  the  within-period  uncertainty  case  in  a somewhat  more  formal  way 
than  the  heuristic  approach  outlined  in  Ref.  [5]  and  as  originally  suggested  by  Smidt  [8].  The  goal 
of  this  effort  would  be  to  provide  a model  of  the  stochastic  elements  in  the  environment  that  is 
consistent  with  the  overall  framework.  For  example,  under  reasonable  assumptions,  can  a character- 
ization be  developed  which  links  to  the  intertemporal  model  through  certainty  equivalents  or  the 
moments  of  well-behaved  probability  distributions? 

A second  general  issue  concerns  the  analytical  tractability  of  the  models,  particularly  when 
one  expands  the  framework  to  incorporate  the  within-period  elements  noted  above.  A third  general 
issue  is  the  empirical  determination  of  the  model.  Do  fundamental  problems  exist,  such  as  in  the 
specification  and  estimation  of  the  economic  production  function  of  the  model,  which  preclude  the 
prospects  for  implementation  from  the  outset?* 

Clearly,  these  issues  are  interrelated  and  the  answers  to  the  questions  may  not  be  easy  to 
obtain.  Nevertheless,  we  feel  that  continued  research  is  the  productive  next  step  for  removing 
qualifications. 


BIBLIOGRAPHY 

[1]  Arrow,  K.  J.,  “Optimal  Capital  Policy,  The  Cost  of  Capital,  and  Myopic  Decision  Rules,” 

Annals  of  The  Institute  of  Statistical  Mathematics  16,  21-30,  Tokyo  (1964). 

[2]  Baumol,  William  J.,  “Optimal  Depreciation  Policy:  Pricing  the  Products  of  Durable  Assets,” 

Bell  Journal  of  Economics  and  Management  Science  638-656  (Autumn  1971). 

[3]  Fama,  E.,  and  M.  Miller,  The  Theory  oj  Finance  (Holt,  Rinehart  and  Winston,  1972). 

[41  Kriebel,  C.  H.,  J.  R.  McCredie,  A.  Raviv,  P.  Wolk,  and  H.  Zia,  “Modeling  the  Productivity 
of  Information  Systems,”  Technical  Report  No.  NSF  APR75-20546/76/TR2,  G.S.I.A. 
and  Computation  Center,  Camegie-Mellon  University,  Pittsburgh,  Pa.  (June  1976, 
Revised). 

[5]  Kriebel,  Charles  H.,  and  Osama  Mikhail,  “Dynamic  Pricing  of  Resources  in  Computer  Net- 

works,” 105-124  in  M.  A.  Geisler  (ed.).  Logistics  (North-HoUandlTIMS  Stvdies  in  the 
Management  Sciences),  1 (1975). 

[6]  Littlechild,  S.  C.,  “Marginal  Cost  Pricing  with  Joint  Costs,”  Economic  Journal,  80,  223-35. 

(June  1970). 

[7]  Moring,  H.,  “The  Peak  Load  Problem  with  Increasing  Returns  and  Pricing  Constraints,” 

American  Economic  Review  (Sept.  1970). 

[8]  Pfouts,  R.  W.,  “The  Theory  of  Cost  and  Production  in  the  Multi-Product  Firm,”  Econo- 

metrica  650-658  (October  1961). 

[9]  Smidt,  S.,  “Flexible  Pricing  of  Computer  Service,”  Management  Science  B581-B600  (June 

1968). 

[10]  Thomas,  A.  L.,  The  Allocation  Problem  in  Financial  Accounting  Theory  (American  Accounting 
Assn.,  Evanston,  111.,  1969). 


*Thi8  research  is  currently  in  progress;  e.g.,  see  Ref.  [4|. 


REPLACEMENT  MODELS  UNDER  ADDITIVE  DAMAGE* 


Dror  Zuckermanf 

Cornell  University 
Ithaca,  Nev)  York 


ABSTRACT 

A production  system  which  generates  income  is  subject  to  random  failure. 
Upon  failure,  the  system  is  replaced  by  a new  identical  one  and  the  replacement 
cycles  are  repeated  indefinitely.  In  our  breakdown  model,  shocks  occur  to  the 
system  in  a Poisson  stream.  Each  shock  causes  a random  amount  of  damage,  and 
these  damages  accumulate  additively.  Hir  failure  time  depends  on  the  accumulated 
damage  in  the  s3rstem.  The  income  from  the  s3rstem  and  the  cost  associated  with 
a planned  replacement  depend  on  the  accumulated  damage  in  the  system.  An 
additional  cost  is  incurred  at  each  failure  in  service.  We  allow  a controller  to  replace 
the  system  at  any  stopping  time  T before  failure  time.  We  will  consider  the  problem 
of  sp(«if3ring  a replacement  rule  that  is  optimal  under  the  following  criteria:  maxi- 
mum total  long-run  average  net  income  per  unit  time,  and  maximum  total  long-run 
expected  discounted  net  income.  Our  primary  goal  is  to  introduce  conditions  under 
which  an  optimal  policy  is  a control  limit  policy  and  to  investigate  how  the  optimal 
policy  can  be  obtained.  Examples  will  be  presented  to  illustrate  computational 
procedures. 
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1.  INTRODUCTION  AND  SUMMARY 

A production  system  is  subject  to  a sequence  of  random  shocks  occurring  in  a Poisson  stream 
at  rate  X.  Each  shock  causes  a random  amount  of  damage,  and  these  damages  accumulate  additively. 
The  successive  shock  magnitudes  Pi, Pi,  . . . are  positive,  independent,  identically  distributed 
random  variables  having  a known  distribution  function  F{y).  Any  one  of  the  shocks  might  cause 
the  system  to  fail,  the  probability  of  which  is  a function  of  the  accumulated  damage  caused  by  all 
previous  shocks.  More  explicitly,  if  at  time  t the  cumulative  damage  is  X{t)=x,  and  a shock  of 
magmtude  y occurs,  then  the  system  fails  with  known  probability  l-r(x-l-y)-  The  function  r(-) 
is  referred  to  as  the  survivorship  function.  It  will  be  assumed  that  r{-)  is  a nonincreasing  function 
of  the  cumulative  damage.  Upon  failure,  the  system  must  be  replaced  by  a new  one  having  the 
same  properties,  and  the  replacement  cycles  are  repeated  indefinitely. 

Each  replacement  costs  C{x)  dollars,  where  x is  the  cumulative  d'mage  at  the  time  of  replace- 
ment, and  each  failure  adds  an  additional  cost.  The  mean  rate  of  income,  when  the  system  is  oper- 
ating and  the  cumulative  damage  is  x,  will  be  denoted  by  I{x).  The  act  of  replacing  the  system 
requires  a known  amount  of  time.  Let  r/  be  the  downtime  associated  with  a failure  replacement, 

•This  research  was  supported  in  part  by  the  National  Science  Foundation  under  Grant  Eng-7600-570. 
fNow  at  the  Hebrew  University  of  Jerusalem. 
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and  let  r,  be  the  downtime  associated  with  a planned  replacement.  The  accumulated  damage  in 
the  system  is  observable.  We  allow  a controller  to  institute  a planned  replacement  at  any  stopping 
time  7<i,  where  S is  the  failure  time  of  the  system. 

We  consider  the  problem  of  specifying  a replacement  rule  that  is  optimal  tmder  the  following 
criteria: 

(a)  Maximum  total  long-run  average  net  income  per  unit  time. 

(b)  Maximum  total  long-nm  expected  discounted  net  income. 

Our  primary  goal  is  to  introduce  conditions  under  which  an  optimal  policy  is  a control  limit,  policy, 
and  to  investigate  how  the  optimal  policy  can  be  obtained.  The  term  “control  limit  policy”  refers 
to  a policy  in  which  we  replace  either  upon  failure  or  when  the  accumulated  damage  first  exceeds  a 
critical  control  level  (*. 

Barlow  and  Proschan  [1]  discussed  in  considerable  detail  the  problem  of  finding  an  optimal 
replacement  rule  when  the  time  duration  tiiat  the  ^stem  is  in  service  is  the  only  information  avail- 
able to  the  controller.  Esaiy,  Marshall  and  Proschan  14]  investigate  the  property  of  a breakdown 
model  for  which  the  instants  at  which  damage  to  the  system  occurs  are  Poisson  distributed  in  time 
and  the  magnitude  of  damage  caused  by  each  shock  equals  one.  Taylor  [7]  derives  an  optimal 
replacement  rule  which  maximizes  the  total  long-run  average  net  income  per  unit  time  for  the  case 
when  C(x)  and  I(x)  are  constants  and  the  cumulative  damage  process  is  a compound  Poisson 
process. 

Section  2 treats  the  breakdown  model  imder  the  loug-nm  average  net  income  criterion.  An 
example  will  be  presented  to  illustrate  computational  procedures.  Section  3 considers  the  same 
breakdown  model  under  a discounted  cost  criterion. 

The  following  will  be  standard  notation  used  throughout  the  paper:  .£,[•]=£(•  |Al(0)=*I, 
^»(0=P*(’|Ar(O)=a:),  and  reserve  E (P)  without  affixes  for  expectation  (probability)  conditional 
on  A’(0)=0.  The  notation  E[Y;  A],  where  P is  a random  variable  and  A is  an  event,  refers  to  the 
expectation  E[ljjr\=E{Y\Ij^=l\P{A),  where  is  the  set  characteristic  function  of  A. 


2.  OPTIMAL  REPLACEMENT  UNDER  THE  LONG-RUN  AVERAGE  NET  INCOME 
CRITERION 


Let  [X{t);  0<<<3}  be  the  stochastic  process  described  in  the  previous  section,  interpreted  as 
the  cumulative  damage  process  up  to  the  failure  time  of  some  production  system.  We  allow  a 
controller  to  institute  a planned  replacement  at  any  stopping  time  7<^i.  Upon  failure,  the  system 
must  be  replaced  by  a new  identical  one,  and  the  replacement  cycles  are  repeated  indefinitely.  The 
mean  income  rate  as  a function  of  the  cumulative  damage  in  the  system,  /(x),  is  bounded  over  the 
state  space  of  the  damage  process  and  nonincreasing  in  x.  A failure  replacement,  the  event  { T=S], 
costs  K dollars.  The  cost  of  a planned  replacement  as  a function  of  the  cumulative  damage  at 
replacement  time,  C7(x),  is  assumed  to  be  bounded  and  nondecreasing  in  x,  with  G(0)=0  and 
C(x)  <K  for  every  x contained  in  the  state  space  of  the  damage  process. 

To  obtain  the  long-term  expected  net  income  per  unit  time,  consider  the  renewal  process 
formed  by  repeated  replacements  of  identical  systems.  Theorem  3.16  of  Ross  [5]  implies  that  the 
long-run  average  net  income  per  unit  time  is  the  expected  net  income  over  a replacement  cycle 
divided  by  the  expected  duration  between  replacements.  That  is,  the  average  net  income  associated 
with  a stopping  time  T will  be 


(2.1) 


E [ W#))*]-  E{C{X(X)) ; r<«l-iiCP{r=«} 

* E[T\+r,P{T<i)+T,P{f=t} 
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Let  ^*=31^  be  the  optimal  average  net  income  rate.  Let  and  ^ be  lower  and  upper  bounds 
respectively  for  Also  let 

fl(*)  = /r(x+y)dF(y). 

Consider  the  following  two  cases: 

CAS£  1:  Tf  < Tf,  and 

(2.2)  ,(x)=X(X+^!(r,-r,))(l-«(x)]-X[C(x)-y'C(x+y)r(x  + y)dF(»)]-/(x) 

is  nondecreasing  in  x. 

CASE  2:  T,>T/,  and 

(2.3)  ♦ (x) = X (liL+ rf/tir,-  T,))ll—Rix)]  — \[C (x) — y C(x +y)r{x+y) dF{y)]  — /(x) 
is  nondecreasing  in  x. 

In  this  section,  we  will  show  that  in  both  cases  above,  an  optimal  stopping  time  T*  is  deter- 
mined by  a single  control  value  {*.  The  optimal  strategy  calls  for  replacement  upon  failure  or  when 
the  cumulative  damage  first  exceeds  {*,  whichever  occurs  first;  i.e., 

(2.4)  r*=min{inf  {«>0;X(t)>{*},«}. 

Furthermore,  we  will  show  that  and  rf/*  can  be  determined  by  solving  together 

(2.5)  {•=inf  {x;  ^•-|-X(i:-b^*(r,-r,))(l-ii(x)]-X[C(x) 

-J‘C{x+y)r{x  + y)dF(y)]  -/(x)  >0} 

and 

(2.6)  ^•{Fm  + r,P{r*<«}-Hr,P{r*=«}}-F[J^'"/(A'(«))rf«] 

-i-F[c(X(r*));  r*<«]-i-2iCP{r*=«}=o, 

where  T*  in  (2.6)  has  to  be  represented  as  a fimction  of  (*,  as  will  be  demonstrated  in  Example  1. 

Before  proceeding,  let  us  consider  the  infinitesimal  operator  A of  the  damage  process.  The 
domain  of  A is  the  set  of  bounded  Borel  measurable  functions  / defined  on  [0,  <»).  Let  / be  in  the 
domain  of  A.  Using  the  definition  of  the  operator  A,(x)  (see  Dynkin  [31,  p.  3),  we  obtain 

A,(x)=lim  r«{£;[/(A-(t));  ««]-/(a:)} 

(4.0 

(2.7)  =lim  f"‘{(l— X0/(a:)+Xfy’/(2+l/)>’(x-fy)dF(y)+o(f)— /(*)} 

(4.0 

= -HHx)  - ff(x+y)  r {x+y)dF(y)]. 

A key  tool  for  us  is  the  following  formula: 

(2.8)  EJ[J(XiT)y,  T<6]-nx)=E,  A,(A'(«))<i«]( 

valid  for  any  / in  the  domain  of  A and  any  stopping  time  T having  finite  expectation  (see  Theorem 
5.1  and  its  corollary  in  Dynkin  [3]). 

To  begin  the  derivation,  note  that  for  every  stopping  time  T 

’’  I (X(t))  d«]-  EIC(X(D) ; r<«]-KP{T=«} 

ElT]+t,P{  T<S}-hr,P{T=t} 
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A stopping  time  T maximizes  the  long-run  average  net  income  rate  if  and  only  if  T minimizes 
Or,  where 

(2.9) 

er^{r-i>T){E[T]+r,P{T<S}  +r,P{T=B} } 

and  the  minimum  value  of  Or  is  zero.  By  the  strong  Markov  property  we  have 

(2.10)  E[T]=E[i]-E[&-T-,  r<a]=W(o)-£:[Tr(A’(r)):  T<i] 


where 

W(x)=B.[5]. 

Using  (2.10),  (2.9)  can  be  rearranged  to  give 

(2.11)  Or=rr,-E^^^ I{X{8))ds'^+{Ey{X{T)y,  T<6]-m}, 

wlXGFO 

j{x)=-4>*W{x)+C{x)-rr,+^*r,-K. 

Since  W{x)  and  C(x)  are  bounded  as  functions  of  x (Taylor  (7]  proved  that  W{x)  is  bounded 
as  a function  of  x),  we  may  apply  formula  (2.8)  to  yield 

A,(X(s))(fe]- 

By  applying  a standard  renewal  argument  the  following  is  obtained: 

(2.12)  ir(x)= J-l-  fW{x-\-y)  r {x-\-y)dF{y). 

Using  equations  (2.7)  and  (2.12),  we  obtain 

A,(x) =^*-X  [U(x)  - fC{x+y)  r {x+y)  dF{y)]+\ (iiC-t-^*(r,-r,))  (1  -fi(x)l. 


We  can  now  write 

(2.13)  0T=^*r,+E^^  J{X(,8y)d8^> 
where  J(x)  s A,{x)  — /(x) . 

Our  goal  is  to  minimize  Or  as  a function  of  T.  First  consider  the  case  T,<Tf.  J{x)  can  be  re- 
arranged to  give 

(2.14)  J(x)=,(x)-1-X(^*-^J)  (r,-r,)(l-fl(x)l-i-^*, 
where  ti{x)  is  given  by  (2.2). 

Assume  that  q(x)  is  nondecreasing.  It  can  then  be  seen  that  J{x)  is  also  nondecreasing  in  x. 
Now,  since  f*=inf  {x;  J{x)  >0}  (see  (2.5)),  J(x)  >0  if  and  only  if  x>£*.  Thus,  by  definition  of  T* 
(see  (2.4)),  we  obtain  for  all  <<5 


J (X{t))<C.O  if  and  only  if  t<C.T*. 
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For  every  stopping  time  T, 


(2.15) 


J{Xis))d8'j-E  J(X(«))<fe] 

j{X(8))d8;  r>r*J<o. 


Thus,  T*  minimizes  dr  and  (2.6)  expresses  the  boundary  condition  tfr.=0.  This  completes  the  proof 
of  the  optimality  of  T*  in  Case  1.  By  repeating  the  same  argument,  we  can  easily  establish  the 
optimality  of  T*  in  Case  2. 

REMARKS: 

1.  The  optimal  critical  level  {*  is  nonincreasing  as  a function  of  (.Tf—r,). 

2.  The  main  result  of  Taylor  [7]  can  be  obtained  as  a special  case  from  equations  (2.4), 
(2.5),  and  (2.6). 

3.  If  C{x)  is  a constant  and  t/>t„  an  optimal  policy  is  a control  limit  policy. 

4.  In  some  cases,  it  is  possible  to  expicss 

£:[J^'’*/(A'(«))d«] 


analytically  as  a function  of  (see  (2.6)),  as  will  be  demonstrated  in  Example  1; 
otherwise,  simulation  methods  are  needed. 

5.  It  turns  out  that  even  if  /(z)  is  nonincreasing  and  C(x)  is  nondecreasing,  an  optimal 
policy  is  not  necessarily  a control  limit  policy.  To  be  more  specific  let  us  consider  the 
following  example  (the  downtimes  associated  with  a planned  replacement  and  with  a 
failure  replacement  are  negligible) : 


r(2)  = 


1 if  2<3, 
,0  if  2 > 3. 


E(.y)=0  for  y<l,  1/2  for  1 <y<2  and  1 for  y>2. 


K=200,  X=l,  and  /(z)=1000. 


C(x)= 


0 for  z=0, 

1 for0<z<l, 
1+199(2-1)  for  l<x<2. 


[200  for  z>2. 


The  state  space  of  the  damage  process  contains  the  states  {0,  1,  2}.  The  theory  of  optimal 
stopping  in  Markov  processes  (Theorem  3 of  Taylor  [6])  implies  that  the  optimal  policy  is  the  best 
one  among  the  following  set  of  policies : 

Vd)— replace  at  z=l,  or  at  failure  point  of  time. 

•’(I)— replace  only  at  failure  time. 

Vd)— replace  at  z=2,  or  at  failure  time. 

■•(4)— replace  at  2=1,  2. 

It  can  be  seen  that  the  policy  ir(i)  is  the  optimal  policy,  but  clearly  it  is  not  a control  limit 
policy. 
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EXAMPLE  1 : Let  us  consider  the  following  case : 

1 if  2<£, 

r(z)  = 

P if  z>L, 
■ C(x)=ax, 

I(x)=b-2  X, 


F{y)=l-e-‘‘o, 

K=2aL, 

r,<r/. 


(a>0), 

(6>0), 

(y>0), 


To  avoid  trivialities,  we  assume  that  the  optimal  net  income  rate  is  positive.  In  view  of  this,  we 
can  choose  0 as  a lower  bound  for  Also  suppose  that  L>l/it,  which  is  a reasonable  assumption. 
In  order  to  show  that  an  optimal  policy  is  a control  limit  policy,  it  suffices  to  verify  monotonicity 
of  ti(x)  (see  (2.2)).  In  our  case, 

(2.16)  i,(x)=2aIX  dF{y)-\  a(x+y)dF(y)]-(6- j x) 

M It  L 


Since  L>  1/m,  the  optimal  policy  is  a control  linvit  policy  and  ({*,  ^*)  can  be  determined  by  jointly 
solving  equations  (2.5)  and  (2.6).  Clearly,  {*<£.  Let  N be  the  index  of  shock  at  which  failure 
occius.  In  order  to  express  the  expected  income  per  cycle  under  the  optimal  policy  as  a function  of 
(*,  we  wUl  introduce  the  substochastic  kemal 

(2.17)  K^(()=P{N>n  and  F,+r.+  . . . +r.<{} 

^p-’d)  forKA 

°'1p’<->(L)  for{>A 

where  is  the  K-iold  convolution  of  F.  Here,  F is  exponentially  distributed  with  parameter 
Ml  and  so  for  t<L 

(2.18) 

Since  each  n<^N  has  an  associated  intershock  mean  time  of  X~',  we  obtain 

(2.19)  /(X(a))d.J=X-'  j/(0)+g  I{z)K*(dz)^ 

« 

=X-'  j6+6Mf*-|£({*)‘j- 

The  expected  time  between  two  successive  replacements  under  the  optimal  policy  will  be 

(2.20)  E(r*l=x-  {H-gP{iV>n,  and  F,+  F,+  . . . +F.<f}) 

=X-'  {l+g  ii?(f)}=X-‘{l+Mf}. 


t 
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By  using  the  memoryless  property  of  the  exponential  distribution  we  obtain 

(2.21)  P{r*=j}=e-i'»-i*) 

and 

(2.22)  E[C^X(.T*));  T*<S]=£~^  a(y+f)M«-'‘»  dy 

Thus,  equations  (2.5)  and  (2.6)  reduce  in  this  case  to 

(2.23)  {•4.^*x(t,-t,)«-<-«-*'>=0 

M ft  L 

and 

(2.24)  (l+fi{*)+^*r,(l-«-<'»-n)+»*T,«-<-»-l*> 

The  optimal  strategy  can  be  determined  by  jointly  solving  equations  (2.23)  and  (2.24). 

3.  OPTIMAL  REPLACEMENT  IN  THE  DISCOUNTED  CASE 

Wo  now  attempt  to  maximize  the  expected  total  discounted  net  income.  For  a given  stopping 
time  T,  the  expected  discounted  net  income  from  the  first  replacement  cycle  is 

(3.1)  T<i]~E[e-~^K)  T=i\.  | 

Generally,  the  expected  discounted  not  income  from  the  n"*  replacement  cycle  is  I 

(3.2)  V,,.{n)=UT..{X){e-”»E[e-‘*-,T<i]+e-"'E[e-‘^‘,T=i\Y-\ 

Clearly,  we  can  restrict  our  attention  to  the  following  set  of  stopping  times;  I 

r={T-,X{T)^0}. 

For  each  stopping  time  T*  T,  I 

^(r]>i>o.  j 

i 

Therefore,  the  expected  total  discounted  net  income  associated  with  a stopping  time  TcT  I 

will  be  I 

^ lim  [discounted  net  income  associated  with  the  t*"  I"!  i 

Lii-»  m |replRcement  cycle  when  a stopping  time  T is  employed.]  J | 

By  Applying  the  dominated  convergence  theorem,  it  foUows  that  | 

^ C^r.«(») 

^ A 


5M 

(8.8) 
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/(jr(*))(i*]-E(*-'(7(jr(r));  T<t[-E[»-‘^K\  r-i) 

r<l]— •—'JBl*--*’;  r-«J 


L»t  TJiwmv^  Vx.m  be  the  mexunal  total  diaoounted  net  income.  Let  UJ"  and  be  lower  and 
upper  bounda,  reapeotiyely,  for  XJi.  Conaider  the  following  two  oaaea: 

CASE  1; 

1. 

2.  C{x)  and  r(x)  are  eontinuoua  functions  over  the  state  space  of  the  damage  process. 

3.  r(»)-X(K+tf.*(s--'»-s-"/))[l-if(*))-X«7{*)-/C(*+y)r(*+v)dF(y)]-/(*)-«C(*) 
is  nondeoreaaing  in  x. 

CASE  8: 

1.  r,>r,. 

2.  C(x)  and  r(x)  are  continuous  functions  over  the  state  space  of  the  damage  process. 

3. »(x)-X(Jr+C^.‘'(s--'*-s-«/))(l-/f(x)l-X((7(x)-/C7(x+y)r(x+y)df(y)l-/(x)-a<7(x) 
is  nondecieasing  in  x. 

In  both  of  the  above  cases,  an  optimal  stopping  time  T*  is  determined  by  a single  control 
value  (S.  The  optimal  strategy  calls  for  replacement  upon  failure  or  when  the  cumulative  damage 
first  exceeds  (t,  whichever  occurs  first.  That  is, 

r*-min  {inf  {t^O;  »]. 

Furtheirmore, 

(3.4)  «-inf{x;X(X+t/:(s-'.-s— /))[l-fi(x)] 

— X|C(x) — /^(x+y)  r (x+y)  dF(y)]—I{x)  —a  C{x)  +«  W*""'  ^0} . 

The  above  can  be  proved  by  applying  the  following  formula; 

(3.8)  E,ls-7(^(r))i-/(x)-fi;[J]’’s— (AA.y(«))-«/(Ji:(«)))*] 

wh«e 

(8.6)  AXx)-lim  t-‘{&(/(-y(0)l-/(x)}. 

ti0 

Formula  (3.5)  is  valid  for  any  function/  such  that/(x)  and  A/(x)  are  bounded  and  continuous  (see 
Breiman  (2],  p.  376).  Note  that  equation  (3.6)  reduces  to  equation  (2.8)  when  a-xO. 

The  proof  of  the  above  described  results  follows  a procedure  similar  to  that  used  in  Section  2 
and  therefore  is  omitted. 

It  is  interesting  to  examine  the  connection  between  the  discounted  case  and  the  undiscountad 
case  when  m ^>proaehes  aero.  For  a given  stopping  time  7 • T we  have  (see  (3.3)) 


lima 

•ts 


E[f’’/(.X((*))*]-ElWr));7<«l-KP{r-l} 

j^Ti+rJ»{r<«}+r,P{r-i} 


and  this  last  expression  is  exactly  the  total  long-run  average  net  income  when  7 is  employed. 
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EXAMPLE  2:  To  illustrste  computational  procadures,  let  us  conaiuer  the  following  simple 
model: 

flifO£a<Z. 

r(s)— 

loif  M'tL. 

/(*)-/. 

C{x)^C, 

X-f7+a,  (o>0), 

I7/-0. 


Assume  that  F is  exponentially  distributed  with  parameter  m*  It  can  be  seen  that  the  above  model 
satisfies  the  conditions  in  Case  1 and  as  a result,  an  optimal  policy  is  a control  limit  poU^.  It 
sufilces  to  restrict  attention  to  stopping  times  of  the  form 

Ti-minfinfft^OjXW^f},*},  (t<L). 

First  note  that  for  every 
Let 

Ar({)-infjjir;§‘r.^{[. 

l^tBt  be  the  event  K-^O,  1,2 6? be  the  distribution  function  of  an  exponential 

random  variable  with  parameter  X,  and  be  the  i&fold  convolution  of  Q,  Then 


(3.7)  ^r,(t)-P{r,^t)-gP{E*}0‘«+»(t)-g  (?<*+“«); 

hence, 

(8.8)  El.-'iJ- e-dFr^ {t)-4{a)  g {♦(«) }* 

where  4 is  the  Laplace  transform  of  0 and  Q(4M)  is  the  generating  function  of  a Poisson  distribu- 
tion with  parameter  /i(  at  the  point  S(a).  It  is  well  known  that  ♦(a)a«X/X+a,  and 
therefore, 

(8.8) 


As  a result  of  the  special  structure  of  the  survivorship  function  and  the  memoryless  property  of  the 
exponential  distribution,  it  can  easily  be  seen  that  7||7|<l,  7||7|-*l  and  7(  are  idoitically  dis- 
tributed. Thus,  equation  (3.3)  reduces  in  this  case  to  simply 


rr  « 

t/r,..- 

1-fa- 


(3.10) 


D.  ZUCKERMAN 


In  order  to  find  the  optimal  policy,  we  have  simply  to  minimize  I7r|.>  for  0<i<L. 

REMARK:  In  some  cases,  it  is  impossible  to  express  UT(.m  analytically  as  a ftmction  of  t, 
and  simulation  methods  are  needed.  ^ 
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ABSTRACT 


This  paper  studies  the  one-period,  general  network  distribution  problem  with 
linear  costs.  The  approach  is  to  decompose  the  problem  into  a transportation 
problem  that  represents  a stocking  decision,  and  into  decoupled  newsboy  problems 
that  represent  the  realization  of  demand  with  the  usual  associated  holding  and 
shortage  costs.  This  approach  leads  to  a characterization  of  optimal  policies  in 
terms  of  the  dual  of  the  transportation  problem.  This  method  is  not  directly  suitable 
for  the  solution  for  large  problems,  but  the  exact  solution  for  small  problems  can 
be  obtained.  For  the  numercial  solutions  of  large  problems,  the  problem  has  been 
formulated  as  a linear  program  with  column  generation.  This  latter  approach 
is  quite  robust  in  the  sense  that  it  is  easily  extended  to  incorporate  capacity  con- 
straints and  the  multiproduct  case. 


THE  PBOBLEM 

The  problem  can  be  stated  as 

^ ^ ♦<(«<).  t=l.  2, . . .,  n 

j=l,  2, . . .,  n, 


(1) 


where 

(la) 

subject  to 
(2a) 

(2b) 


(a,-()yi(()d(-hCu<  fj  ({-«.)/.({) d{, 


*11—^  Xij,  -Vf 
*0  Oi,  Xif>0,  4^1,, 


♦This  work  was  performed  under  ONR  Contract  N00014-67-A-0204-0076  on  Multilevel  Logistics  Organization 
Models  at  the  Massachusetts  Institute  of  Technology. 
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In  this  paper  we  consider  the  single  product,  single  period,  multilocation  inventory  problem 
with  stochastic  demands  and  transshipment  between  locations.  This  problem  was  posed  and 
investigated  by  Gross  (4),  where  exact  solutions  were  obtained  for  the  one  location  and  two  location 
cases.  Gross’  method  of  solution  rapidly  becomes  complicated  to  the  point  of  intractability  as  the 
number  of  locations  increases,  and  Gross  suggests  that  search  techniques  be  used  to  obtain  numerical 
solutions  for  larger  problems. 

In  an  early  paper,  Allen  [1]  discussed  the  problem  of  stock  redistribution.  His  model  is  a 
special  case  of  the  more  general  situation  dealt  with  here.  Krishnan  and  Rao  in  Ref.  [9]  have 
tackled  a one-period  nroblem  similar  to  that  proposed  by  Gross.  However,  while  Gross’  formulation 
considered  ordering  md  -hipping  decisions  made  simultaneously  at  the  start  of  the  period,  the 
approach  here  was  to  determine  optimal  ordering  decisions,  given  that  transshipment  decisions 
could  be  deferred  till  demand  was  realized.  An  additional  simplification  made  in  this  paper  was  to 
assume  that  all  transshipment  costs  are  equal.  This  allows  arbitrary  partitioning  of  the  locations 
into  groups,  with  the  same  transportation  cost  still  holding  between  any  two  groups. 

This  problem  has  been  examined  in  Elmaghraby  [3]  and  Williams  [12]  using  a stochastic 
programming  type  of  approach.  We  will  here  adopt  essentially  the  same  viewpoint  and  demonstrate 
the  connection  with  Gross’  solution.  The  notation  used  here  is  that  of  Gross  to  facilitate  comparison 
of  the  re.sults.  Das  [2]  has  studied  a similar  problem.  However,  his  investigation  was  limited  to  a 
specific  two-location  problem. 

In  the  following  sections,  a decomposition  methodology  is  developed  and  applied  to  the 
two-location  problem  as  an  illustrative  case.  The  simplicity  of  the  approach  allows  several  cases  of 
special  two-location  problems  to  be  investigated  easily  and,  in  each  case,  the  complete  form  of  the 
optimal  policy  for  all  initial  stock  conditions  can  be  exhibited  (Karmarkar  [6]). 

where 

5 <«= Initial  inventory  position  at  warehouse  i, 

Oi=Quantity  ordered  from  central  location  for  warehouse  i, 
x<>=Quantity  transshipped  from  warehouse  i to  warehouse  j, 

Si=Total  quantity  stocked  at  warehouse  i after  ordering  and  transshipment, 
P«=Variable  cost  of  ordering  for  warehouse  i from  central  location  ($/unit), 
c«^=’rransshipment  cost  from  warehouse  i to  warehouse  j ($/Unit), 

<7oi=Cost  of  overstocking  at  warehouse  i ($/unit), 

C'»<=Cofit  of  understocking  at  warehouse  i ($/unit), 

/<(0=Density  (p.d.f.)  of  demand  at  warehouse  i. 

The  assumptions  made  in  this  model  are  as  follows: 

(a)  The  demand  at  each  warehouse  is  a random  variable  characterized  by  a continuous 
density  function. 

(b)  Delivery  of  stock  is  immediate. 

(c)  Setup  costs  of  ordering  and  transshipping  are  negligible. 

(d)  Inventory  cannot  be  disposed  of  or  salvaged.  (This  assumption  is  easily  relaxed.) 

(e)  Purchasing,  transshipping,  holding,  and  shortage  costs  are  all  linear.  With  respect  to 
the  last  two,  we  may  make  the  weaker  assumption  that  the  one-period  costs  are 
convex. 


N-LOCATION  DISTRIBUTION 


561 


(f)  There  are  no  capacity  restrictions  on  warehouses. 

(g)  There  is  no  restriction  on  the  amount  of  supply  available  from  the  central  location. 
It  is  shown  elsewhere  [6]  that  assumptions  (f)  and  (g)  can  be  relaxed. 

It  will  be  assumed  throughout  this  article  that  for  any  combination  of  activities  that  is  considered, 
the  marginal  cost  of  supply  rt  to  any  location  i is  less  than  the  backlog  cost  Cm  at  that  location ; 
furthermore,  ‘Wi'>—Coi.  This  assumption  is  necessary  for  solutions  to  exist,  and  is  discussed  rigor- 
ously by  Williams  [13]  and  Earmarkar  [6]. 

It  is  also  assumed  in  [4]  that 


i)  ci,=e,i 
ii)  Vi+Cu>Pi, 


The  last  restriction  is  the  so-called  "triangular  restriction”  which  says  that  it  is  always  cheaper 
to  order  directly  at  any  location,  rather  than  to  order  at  another  location  and  transship.  It  turns 
out  that  this  assumption  really  leads  to  the  most  general  case  in  terms  of  the  number  of  different 
optimal  policies  involved,  and  that  the  assumption  can  be  relaxed  with  no  change  in  the  approach 
[6]. 

Let  us  first  write  down  the  Kuhn-Tucker  conditions  for  the  problem.  We  note  that  since  the 
problem  involves  minimizing  a convex  function  over  a convex  set,  these  conditions  are  necessary 
and  sufiScient. 


(3a) 

i)  «:(»f)+Xf>0, 

-Pi 

(3b) 

P«-Xf>0, 

-Pi 

(3c) 

Co+X?-Xj>0, 

i^j 

(4a) 

ii)  [«'(«?)+'7]«r=0. 

-Pi 

(4b) 

[p<-X?]0?=0, 

-Pi 

(4c) 

[c<j-|-Xy  X*]a;*y=0, 

(5a) 

iii)  «*-«  "-0?+^  a:Tr 

=0, 

4^1 


(5b) 


«f,  0?,  *r/>o. 


We  now  reformulate  the  problem  by  separating  the  objective  function  into  a linear  part  (LP) 
and  a nonlinear  stochastic  part  (NLP).  The  nonlinear  part  can  be  decoupled  into  indejendent 
newsboy  problems  and  the  linear  part  forms  a transshipment  problem  (with  the  triangular  restric- 
tion, this  is  a transportation  problem). 


NLP: 


(6) 


LP: 


Min7,«,-|-«i(«,) 

(•<1 

«i>0 


i=li  2 n. 


(7) 
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subject  to : 

(6a)  ^ ^ 

Of,  Xit^O. 

Now  we  can  write  the  Kuhn-Tucker  conditions  for  each  of  this  pair  of  problems  and  show 
that,  under  certain  global  restrictions,  they  are  equivalent  to  the  optimality  conditions  for  the 
original  problem. 

NLP: 


(8a) 

«:(«?) +7i>0,  -Pi 

(8b) 

i«;(8j)-i-7,]»r=o, 

(8c) 

St>0,  -Pi. 

LP: 

i) 

Primal  Constraint: 

(9) 

s,-s,o+Ot+^  ^?.=0,  -Pi 

ii) 

Dual  Constraints: 

(10a) 

r*  <pi,  -Pi 

(10b) 

iii) 

Complementary  Slackness: 

(11a) 

[?«—»*)  0?=0,  -Pi 

(11b) 

— v*]X*f=0,  -Pit  J, 

(11c) 

ot,  xt,>0. 

Now  if  we  add  to  these  conditions  the  requirements  that  yi=ir*  and  8i=s*,  we  see  that  these 
conditions  are  identical  to  the  Kuhn-Tucker  conditions  for  the  original  problem.  This  immediately 
suggests  that  a naive  algorithm  for  solving  the  problem  might  consist  of  alternatively  solving  the 
two  subproblems;  i.e.,  using  the  NLP  to  generate  targer  inventory  levels  «,  for  the  LP,  and  letting 
the  LP  determine  shadow  prices  «■«  that  are  then  used  in  the  NLP  as  the  marginal  cost  of  purchase 
*■«.  It  appears  that  such  a procedure  will  not  in  general  converge,  but  suggests  computational 
methods  that  are  discussed  elsewhere  [6,  7]. 

Let  us  draw  attention  to  the  dual  of  the  transportation  subproblem.  We  can  write  the  dual 
as 

DLP 

(12) 

subject  to 


Max  S (8t—»i°)ri 


(><l 
«•<<?«! 


-Pt 
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We  see  that  the  dual  has  a very  simple  structiuw,  as  might  be  expected.  Interpreting  as  the 
marginal  cost  of  providing  an  extra  unit  at  the  t**  location  gives  the  dual  constraints  an  intuitive 
meaning.  The  first  set  of  constraints  says  that  the  marginal  price  should  be  less  than  or  equal  to 
the  variable  cost  of  ordering  from  the  central  loction.  The  second  set  says  that  the  cost  of  providing 
an  extra  umt  at  the  location  should  be  less  than  or  equal  to  the  cost  of  providing  that  unit  at 
the  1“  location  and  then  transshipping  it  to  the  j'*‘  location  at  a cost  c«. 

To  obtain  some  insight  into  the  structure  of  the  problem,  we  now  examine  the  two-location 
problem  in  some  detail. 


THE  TWO-LOCATION  PROBLEM 


Let  us  assume  for  the  sake  of  simplicity  and  without  loss  of  generality  that  the  transshipment 
costs  between  the  two  locations  are  equal  in  either  direction.  We  will  also  assume  that  the  “tri- 
angular restriction”  is  operative,  so  that 


P\<p*+c 

Pt<Pi-\-e 


Now  we  can  visualize  the  problem  as  setting  target  stock  levels  «,  and  «i  so  as  to  minimize  the  total 
costs  of  transshipment  and  subsequent  realization  of  demand.  We  have  to  set  target  stock  levels 
such  that  the  total  stock  in  the  system  is  not  less  than  the  starting  stock,  since  we  cannot  dispose 
of  any  stock.  Once  the  target  stock  levels  are  picked,  we  may  think  of  the  ordering  and  transship- 
ment decisions  as  being  made  "automatically”  by  the  LP  subproblem.  Thus,  if  we  parametrize 
the  subproblem  by  the  target  stock  vector,  wo  can  rewrite  the  original  problem  in  terms  of  just  the 
target  stock  levels  as  the  decision  variables. 


where 


Min  «;(«,«,) =z(«,, 

•1p  «} 


subject  to 


«,>0, 


2 (*1,  «»)=Min  pi0|+p»0|-|-ciri»-|-ca:*, 

Oi.  Ot 
Xu 


Pf 


subject  to 


0i— zii  +xii=«i— «i® 


*11.  Xii,  Ot,  0»>0. 
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Thus,  we  now  solve  the  problem  Mt,  where  the  ordering  and  transshipment  cost  function  z(8i,  «i)  is 
characterized  as  the  optimal  value  of  the  subproblem  Pt.  We  note  that  we  have  retained  the  con- 
straints in  Mt  in  order  to  ensure  the  existence  of  a feasible  solution  to  subproblem  Pi. 

We  shall  now  obtain  this  transshipment  cost  function  explicitly  by  considering  the  dual  of  the 
problem  Pt: 

(D)  z(»„«i)=Max  («i— «i°)vi+(»i— «i®)vi 

— »i+vi<c 

vi— vt  <c 
Vi  <Pl 

The  feasible  region  to  this  program  is  shown  in  Figure  1.  Given  a target  stock  vector  8=(8i,  8i), 
we  wish  to  find  a feasible  pair  (vi,  n)  so  as  to  maximize  the  objective  function,  that  is  to  maximize 
the  projection  on  the  gradient  vector  of  the  dual  objective  function. 

We  can  see  by  direct  inspection  of  the  dual  feasible  region  that 

(a)  For  «i— «i®>0,  «i— «i'’>0,  (pi,  pt)  is  the  optimal  point  and 

2 («,,  «i)  = Pi  («j— «i®)  -f  Pi  («i— «1®) . 

(b)  For  «i— *1— «i®-|-«i— «i®>0,  (pi— c,  pi)  is  the  optimal  point  and 

2(«„  «i)=(pi— c)  («|— «i°)+pi(«l— «"). 

(c)  For  «i— «i®<0,  «i— «®-|-«i— «i®>0,  (pi,  pi— c)  is  the  optimal  point  and  . 

2(«1,  8i)=Pl(9t-8°)  + (Pt-e)i8t-8P). 


Fiourz  1.  The  two-location  case:  feasible  region  for  the  dual  problem. 
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These  represent  the  extreme  points  of  the  feasible  region,  and  we  get  an  expresdon  for  the  objective 
function  in  each  case,  in  terms  of  («i,  «i).  We  can  also  say  something  about  the  cases  where  the 
objective  function  is  parallel  to  an  edge  of  tiie  feasible  region: 

(d)  For  «i-s»®=0,  S|— s»®>0,  we  have  vt=pi,  i>»-c<vi<pi. 

(e)  For  «i-«®>0,  s»— **"==0,  we  have  »i=pi,  pi-c<w,<p,. 

(/)  For  «i-si®+s»-st®=0,  ai-*,®<0,  ^-«i®>0,  we  have  Vf-iri=c,  »i<pi-c,  vt<pt- 

(g)  Fore,-«i®+St— i»®=0,  8|— »i®>0,  *,-Si®<0,  we  have  »i— ti=c,  vi<pi,  v»<pi— c. 

{h)  For  *1— s ®=0  and  s*— Si®=0,  any  feasible  w is  optimal,  and  finally 

(t)  For  («i— «i®+*t— «a®)<0,  we  have  an  unbounded  solution  to  (D)  corresponding  to 
an  infeasible  primal. 

We  can  represent  this  information  about  the  dual  variable  values  on  the  (t|,  St)  plane.  This 
has  been  shown  in  Figure  2.  The  feasible  region  for  the  original  problem  M is  the  portion  of  the 
(•ii  *1)  plane  such  that  •i+ai>Si*+<t*i  Si>  Si^O.  The  regions  of  the  boundary  of  the  dual  feasible 
region  correspond  to  regions  in  the  (si,  »t)  plane  and  have  been  marked  correspondingly  with  the 
appropriate  (vi,  v«)  values  shown. 

One  method  of  explicitly  determining  these  optimal  policies  is  to  visualize  the  problem  at 
starting  point  0 in  Figure  2 with  a stock  position  («i^  St*)  and  to  move  away  from  this  point  in  a 
feasible  direction  to  a new  stock  level  that  minimizes  the  total  cost  tD(<i,  <»).  The  point  0 communi- 
cates  with  all  seven  regions,  and  it  is  worth  moving  away  from  it  in  the  direction  of  one  of  these 
regions  if  the  cost  in  that  direction  is  decreasing.  We  can  clearly  use  gradient  arguments  to  char- 


nOURE  3.  The  two-looatioii  ease:  polioy  reglona  in  the  (si,  ii)  plane. 
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acterize  the  policies,  especially  when  we  note  that  the  derivative  of  the  cost  function  in  the  direction 
of  one  of  the  axes  (where  it  exists)  is  independent  of  the  other  variable;  that  is, 

Vw(«i, «»)  = +«-i,  ^(«i) + v»), 

where  T|  and  vj  depend  on  the  region  in  which  Vw  is  evaluated.  We  may  also  note  that 

(a)  For  «i>Si®,  T,=pi;  Si<«i®,  vi=Ps— c. 

(b)  For  V2=l>2;  S2<Sj®i  v2=Pi— c. 

It  is  thought  that  this  approach  will  prove  useful  in  larger  problems  in  making  numerical 
computations.  However,  to  determine  the  exact  nature  of  the  optimal  policies  in  this  case,  it  is 
simplest  to  directly  apply  the  Euhn-Tucker  optimality  conditions  to  each  region,  noting  that 
the  associated  values  of  the  dual  variables  already  satisfy  conditions  (9)  through  (lie).  So  we  have 
to  consider  only  conditions  (8o)  — (8e)  which  effectively  imply  that  either  s<=0  or  ^('(«<)  + «-<=0. 
We  will  assume  merely  for  convenience  that  St^O,  although  the  reader  may  keep  in  mind  that  an 
optimal  stock  level  of  zero  is  a theoretical  possibility.  Also  for  notational  ease,  we  define  y=«*(p) 
to  be  the  solution  of 

My)+p=0- 

We  note  that  since  ^i(')  is  convex,  ^K-)  is  a nondecreasing  function  of  its  argument,  and  it  thus 
follows  that  «*(p)  is  a nondecreasing  function  of  p.  (This  simply  says  that  as  the  marginal  cost  of 
supplying  a location  increases,  the  optimal  stock  level  at  that  location  decreases.) 

Now  applying  the  K— T conditions  to  each  of  the  regions,  we  have 
REGION  1:  The  region  is  defined  by  «2>S2®- 

Furthermore,  we  have 

^'i(«i)+Pi=0;  ♦'2(«2)+P2=0. 

Therefore,  the  optimal  policy  is 

S?  = «?(pi);  8^=8t(pi), 

and  this  policy  applies  when 

«i®<«f(pi);  S2®<«J(P2)- 

REGION  2:  We  have  ^J(®?)  + (p2— c)=0;  ^(«2)+P2=0, 
which  imply  the  optimal  policy 

«?  = «T(P2-C);  S2=S2(P2). 

This  policy  is  optimal  for  starting  stock  (®i®,  82®)  such  that 

«l®>«f(P2-c);  8j«<8j(p,) 

81  ® + 82®  <8?  (p2  - c)  + 8 J (P2) 

REGION  3:  As  for  region  2,  ^((8j)+pi=0;  0i(8j)+(pi— c)=0. 

The  optimal  policy  is 

8?  = 8?(p,);  8;=  8j(pi-c). 

This  policy  applies  if 

8i®<8f  (p,)  ; 8,®>8j(pi  -C) 
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REGION  4;  We  have  and  since  4»(««)+Pi=0,  we  have 

Now  we  know  that  the  optimal  value  of  the  dual  variable  vi  must  be  such  that  Pi— c<vi<j>i. 
Hence,  this  policy  will  apply  when 

♦;(«i*)+vi=o; 

REGION  6:  As  in  region  4,  the  optimal  policy  is 

«?=*i(pi);  *•=»••. 

This  policy  applies  when 

»4*<«?(i>i) ; <i(pi)  -e) 

and  the  optimal  value  of  the  dual  variable  is  given  by 

4i(*i*)+vi=0. 

REGION  6:  The  region  consists  of  points  in  the  feasible  region  such  that 

For  the  dual  variables,  we  have  that 

vi<P»— c,  v*<p»;  vi=c, 

and  we  have  the  conditions 

4.'(*r)+».=0;  4;(«J)+t,=0. 


FYom  the  conditions  on  the  dual  variables,  we  have  that  the  policy  will  apply  when 

The  optimal  policy  is  given  by  and  ^ such  that 

4;(«f)-4;(«n=c 

REGION  7 : The  analysis  here  is  similar  to  region  6.  The  optimal  policy  is  given  by 


and 


This  policy  is  optimal  for  starting  stocks  such  that 

«»® >**  (Pt-e)  ; >*J(pi  -e)  +•? (pO • 
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REGION  8:  Here,  i.e.,  the  optimel  policy  is  to  sUy  put. 

The  Tsrious  optimal  policy  regions  correspond  to  conditions  on  the  starting  stocks.  We  can 
represent  these  conditions  on  the  s-plane  so  as  to  give  the  optimal  poliqr  as  a function  of  the  starting 
stocks.  This  is  done  in  Figure  3,  and  this  diagram  is  seen  to  be  the  same  as  that  obtained  by  Gross 
[4].  We  have  thus  succeeded  in  recovering  his 'results. 


»i*(p2-el  *1 


Fiouaa  3.  The  two^oeation  ease:  optimal  policies  for  different  starting  stock  conditions. 

A NUMERICAL  EXAMPLE 

This  example  is  a slight  modification  of  the  one  described  by  Gross  in  Ref.  (4).  The  system 
here  consists  of  two  locations,  only  one  of  which  can  be  supplied  exogenously  at  a cost  of  11.60/unit. 
The  other  location  can  dispose  of  excess  stock  at  Sl/unit.  The  unit  shortage  holding  and  shortage 
costs  are  S2  and  88,  respectively,  for  both  the  locations,  and  costs  of  transshipment  between  locations 
are  ll/unit  in  either  direction.  Both  locations  experience  uniformly  distributed  exogenous  demands. 
The  supply  location  (1)  has  uniform  demand  over  1400-2000  units,  and  location  (2)  demand  is 
uniform  over  700-1300  units.  The  LP  dual  constraints' are 

»i  ^1.0 

— ^1.0 
»i— ^1.0 
-Si  ^1.0 
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The  corresponding  feasible  region  is  sketched  in  Figure  4(a).  For  the  uniform  distribution,  and 
•*  are  easily  calculated  for  any  given  shadow  prices  vj  and  «-].  The  nonlinear  estimation  problems 
are  simple  and  reduce  to 

«? (»i) =2360- 120  T,  2180<«J<2600 
**(»,)  = 1180-  60  T,  1120  <«?<  1240. 

For  the  constraints  corresponding  to  pure  transshipment  policies,  the  equations  of  the  lines  repre* 
senting  optimal  stock  levels  can  be  written  as 

2Sj  “"Sj— 120 
2«,— «,=  — 120 

These  equations  turn  out  to  be  linear  because  of  the  uniform  demand  distribution  assumption. 
The  optimal  policy  is  sketched  in  Figure  4(b) 


Fiouaa  4.  The  two-loeatioa  case:  s numerical  example. 
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THE  FORM  OF  THE  POUCY 


We  can  think  of  the  “form”  of  the  policy  described  above  in  terms  of  a “static"  set  of  points 
(Region  8)  in  which  the  optimal  policy  is  to  stay  put.  In  addition,  the  “boundary"  points  of  this 
set  are  also  optimal  stock  targets  for  starting  stocks  outside  the  set.  These  starting  stocks  can  be 
visualized  as  “cones"  attached  to  the  boundary  of  the  “static"  set  by  the  vertex;  for  starting 
stock  levels  in  a given  cone,  the  optimal  policy  is  to  move  to  the  vertex  of  the  cone. 

This  kind  of  intuitive  description  of  the  form  of  the  optimal  policy  in  fact  applies  to  a more 
general  problem  formulation  allowing  an  arbitrary  constraint  set  [6].  However,  a complete  de- 
scription of  the  optimal  policy  is  another  matter. 

While  the  exact  method  used  above  allowed  us  to  specify  optimal  policies  completely,  its 
practicability  diminishes  rapidly  as  the  number  of  locations  increases.  For  one  thing,  geometric 
intuition  is  of  little  help  beyond  the  case  already  discussed,  and  for  another,  the  number  of  distinct 
policies  to  be  considered  becomes  very  large  as  the  dimensionality  of  the  problem  increases.  (There 
are  37  policies  in  the  three-location  case).  It  would  clearly  be  useful  to  have  a method  of  computing 
the  optimal  solution  for  a given  starting  stock  position  without  having  to  know  the  whole  solution. 
Gross  [4]  has  suggested  the  use  of  gradient  search  techniques  involving  n’  variables.  Skeith  [11] 
has  used  a general  NLP  code  to  solve  a multilocation  example.  Presumably,  more  efficient  search 
techniques  could  be  devised  using  the  structural  properties  of  the  problem  as  revealed  in  the  anal- 
yses above. 

A COMPUTATIONAL  METHOD 

We  present  below  a brief  description  of  a column  generation  (generalized  programming) 
technique,  which  is  an  immediate  consequence  of  the  problem  decomposition  used  in  equations 
(6)  and  (7). 

The  subproblems  are  newsboy  problems  corresponding  to  each  location,  which  generates 
proposals  for  the  master  problem.  The  proposals  are  in  the  form  of  columns  with  entries  correspond- 
ing to  a proposed  target  stock  level  for  a location  and  the  associated  shortage  and  holding  cost. 
These  columns  are  then  incorporated  into  the  master  problem,  which  is  a suitably  modified  ver- 
sion of  the  transportation  subproblem.  The  master  problem  then  selects  linear  combinations  of 
the  proposal  columns.  Since  the  holding  and  shortage  cost  functions  ^<(8|)  are  convex,  the  linear 
combination  of  costs  from  the  columns  will  overestimate  the  true  costs  involved.  This  essentially 
ensures  that  successive  generation  of  columns  will  lead  to  belier  approximations  of  the  cost  func- 
tions ^|(•)• 

As  columns  are  generated,  the  linear  combination  of  adjacent  proposals  will  lead  to  an  inner 
linear  approximation  of  these  cost  functions.  Since  the  master  problem  is  one  of  minimization,  it 
will  automatically  choose  adjacent  proposals  when  forming  linear  combinations.  This  procedure 
is  shown  in  Figure  5.  The  column  generation  procedure  can  be  described  as  follows. 


Initially,  we  will  generate  n columns  corresponding  to  t|y^  starting  stocks  at  the  n locations. 
This  will  guarantee  the  existence  of  a feasible  solution  to  the  master  problem.  The  master  problem 
at  the  iteration  is  of  the  form 
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subject  to 


Min  ^ 60*0+^  2>A+^  S i=l, 


lz«l,  |0<)|Xu| 


3=1, 


ti  •■ 

^<1  ^Ui  ^rt^O- 


.,n 

.,n 


We  are  assuming  here  for  the  sake  of  simplicity  that  one  column  is  generated  for  each  subproblem 
at  each  iteration,  rt  and  Mt  are  the  shadow  prices  corresponding  to  the  constraints  as  shown.  With 
these  shadow  prices,  we  solve  n nonlinear  subproblems  of  the  form 

Min  Mi;  Si>0; 

>1 

(The  negative  sign  on  r<  is  due  to  a sign  change  on  the  constraint.)  Of  course,  the  constant  mi  can 
be  dropped  from  the  objective  function.  These  are  easy  to  solve,  and  the  solutions  are  then  passed 
up  to  the  master  problem  as  columns  of  the  form 


Cl.  (i+I) 

0 

S|.(/+l) 

0 

0 

1 

0 

The  master  is  updated  to  include  these  columns,  and  then  solved  again.  The  procedure  is 
repeated  until  optimality  is  reached  or  until  the  solution  is  thought  to  be  close  enough  to  optimality. 
Issues  of  establishing  optimality,  computing  upper  bounds  on  the  optimal  solution,  and  conver- 
gence of  the  algorithm  will  not  be  discussed  here.  A good  exposition  may  be  found  in  Lasdon  [101, 
Suffice  it  to  say  that  the  column  generation  algorithm  will  converge  for  any  convex  program. 

There  are  several  refinements  possible  in  this  procedure  in  the  selection  and  incorporation 
of  columns.  For  example,  we  may  want  to  immediately  select  columns  that  are  likely  to  be  im- 
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FIGURE  fi.  Inner  linear  approximation  of  a subproblem  cost  function  by  successive  columns. 

portant,  corresponding  to  tjrpical  policies  arising  from  extreme  points  of  the  dual.  Examples  of 
these  are  sT(p<)  and  «*(?>— Since  the  master  problem  will  only  use  two  columns  at  a time, 
it  may  be  possible  to  throw  out  some  of  the  columns  generated  after  a couple  of  iterations.  It  is 
also  straightforward  to  generalize  the  problem  by  adding  constraints  (as  on  capacities).  These 
are  simply  introduced  in  the  master  LP  as  in  the  following  example. 

A ^-LOCATION  EXAMPLE* 

Skeith  [11]  describes  a numerical  solution,  obtained  by  using  an  available  NLP  code,  to  a 
five-location  example.  His  example  is  used  here  to  illustrate  the  effectiveness  of  the  column  genera- 
tion technique.  Skeith’s  formulation  differs  slightly  from  ours  in  that  a credit  is  allowed  for  stock 
carried  over  into  the  next  period.  However,  this  essentially  amounts  to  a reduction  in  the  cost  of 
overstocking  by  the  amount  of  credit  allowed.  Transshipment  is  allowed  between  the  warehouses 
in  addition  to  ordering  from  a central  supply  point.  The  centrally  available  stock  is  limited  to 
100  units.  The  demand  at  the  i’th  location  has  an  exponential  distribution  with  mean  9 <:/<({<)  = 
(lldt)  exp  { — The  parameters  of  the  problem  are  listed  in  Tables  1 and  2. 

The  master  problem  in  this  case  is  exactly  as  described  above,  with  the  addition  of  the  con- 
straint on  the  total  stock  ordered: 

^ O,<100. 

« 

The  subproblems  used  to  generate  columns  for  the  master  problem  are  simplified  by  the  exponen- 
tial form  of  the  demand  distributions.  Suppose  that  at  iteration  f,  the  master  problem  yields 


*Thi8  example  was  included  on  the  suggestion  of  the  referee. 
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TABLE  1.  Five-Location  Example:  Cost  Parameters,  Initial  Inventory  Levels  and  Mean  Demands 


Location 

i 

(doSars) 

Co, 

(dollars) 

Cu, 

(dollars) 

(units) 

9i 

(units) 

1 

2.0 

2.0 

10.0 

400 

400 

2 

1.0 

3.  5 

8.0 

450 

200 

3 

2.0 

2.0 

12.0 

625 

500 

4 

3.0 

0.  5 

9.0 

500 

300 

5 

2.0 

3.0 

13.0 

150 

200 

TABLE  2.  Five-Location  Example:  Transshipment  Costs  {Dollars) 


To 

Location 

From 

1 

2 

3 

4 

5 

1 

1.0 

3.0 

1.5 

2.0 

2 

1.0 

3.0 

2.0 

Location  3 

3.0 

3.0 

2.0 

4 

1.  5 

3.0 

3.0 

--- 

1.5 

5 

2.0 

2.  0 

2.0 

1.  5 

... 

shadow  prices  t<‘.  Then  it  can  be  shown  that  the  (<-f-l)th  stocking  proposal  for  the  i’th  location  is 
given  by 

*<•  Jii  [(^Oi — ‘Si)l{Cui'\-Co^]- 

The  expected  cost  of  underage  and  overe^e  at  the  i’th  location  is  given  for  the  exponential  case  by 

exp  {—Stle,)—et)-\-C»,e,  exp  {—sSt)- 
For  stock  levels  as  calculated  above,  this  reduces  to 

Cu  (+i)=C'o('S<,  1+1 — 

The  starting  set  of  stocking  proposals  corresponded  to  “pure  ordering”  and  “no  change”  for  each 
location.  The  latter  set  ensured  feasibility  at  the  first  iteration.  The  optimal  solution  was  reached 
in  just  three  iterations  and  the  final  solution  was  (nonzero  variables  only) 

0, =1.381,  0, =25.065 

z„=154.518,  zjj=57.565. 


The  optimal  stock  levels  at  the  five  locations  were  sf=664.518,  8j=237.9l7,  8j=626.381,  8?  = 
600.00,  and  85=232.63.  The  total  cost  of  the  solution  was  $6653.  This  solution  differs  somewhat 
from  that  published  by  Skeith,  and  it  is  marginally  better  (by  $1  or  so). 


i 
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The  master  LP  problems  were  solved  on  an  existing  interactive  LP  package  called  BIGLP, 
I written  by  Linus  Schrage.  A small  (20-line)  BASIC  program  was  written  to  generate  columns 

I according  to  the  formulae  above,  and  columns  were  entered  interactively.  Computational  comparisons 

are  not  possible  since  Skeith’s  paper  did  not  describe  computational  experience.  However,  in  our 
i,  case,  only  three  LP’s  had  to  be  solved,  the  largest  of  which  had  11  constraints  and  43  variables, 

i 

SUMMARY  AND  CONCLUSIONS 

; The  approach  used  here  is  essentially  simpler  than  that  used  by  Gross.  The  key  notion  is 

J that  the  set  of  points  (Region  8)  where  the  optimal  policy  is  to  stay  put  is  obtained  from  the  dual 

feasible  region  to  the  LP  subproblem.  That  is  to  say,  given  a marginal  cost  of  supply,  the  optimal 
stock  levels  can  be  determined.  It  can  be  easily  verified  that  in  the  one  location  case,  the  solution 
■ reduces  to  the  so-called  “critical  number”  policy. 

i A rigorous  treatment  of  the  problem  is  given  by  Rarmarkar  [6],  and  the  approach  is  extended 

i to  a general  convex  stochastic  programming  formulation.  Multiperiod  and  infinite  horizon  prob- 

' lems,  including  the  lost-sales  case,  are  also  discussed  in  Refs.  [5],  [6],  and  [8]. 

Special  cases  of  two  location  problems  can  be  dealt  with  very  simply,  and  the  optimal  poli- 
cies for  all  starting  conditions  exhibited.  These,  together  with  other  computational  methods  for 
one-period  problems,  are  discussed  in  Ref.  [6]. 
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ABSTRACT 

Using  Markov  renewal  theory,  we  derive  analytic  expressions  for  the  expected 
average  cost  associated  with  (s,  S)  policies  for  a continuous  review  inventory 
model  with  a compound  Poisson  demand  process  and  stochastic  lead  time,  under  the 
(restrictive)  assumption  that  only  one  order  can  be  outstanding. 


1.  A CONTINUOUS  REVIEW  INVENTORY  MODEL. 

The  demand  process  is  of  the  compound  Poisson  type;  i.e.,  the  probability  of  a cumulative 
demand  A;  in  a time  period  of  length  t is  given  by 

,-xi  „(i) 


where  X^O  and  Vt”  is  the  f-fold  convolution  of  the  demand-size  distribution  at  k;  with 

Po=0,  p»>0  ifc=l,  2,  . . .,  denotes,  of  course,  the  probability  that  once  a request  is  made  on  the 
inventory  system,  k items  are  demanded,  t 

The  ordering  process  is  also  assumed  to  be  stochastic.  When  an  order  is  placed,  the  probability 
that  the  order  is  delivered  within  a time  period  of  length  t is  given  by  the  lead-time  distribution 

*Thia  research  is  an  outgrowth  of  the  project  “Sequential  Decision  Making”  of  the  Deoartment  of  Applied 
Economics,  K.U.L. 

tThe  assumption  that  vs>0  for  all  k>  1 is  made  for  analytical  convenience. 
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function  0(t),  which  is  assumed  to  be  continuous  and  Riemann  integrable  and  to  have  a finite 
mean.  (Note  that  G(,t)  is  assumed  to  be  independent  of  the  order  size.) 

As  to  the  cost  structure  we  consider 

(i)  an  ordering  cost  of  the  form  Co+<k>k,  when  k items  are  ordered, 

and  per  unit  time: 

(ii)  an  inventory  carrying  cost  of  Ci  per  unit 

(iii)  a backorder  cost  of  cj  per  unit. 

Furthermore,  it  will  be  assumed  that  only  inventory  policies  of  the  («,  S)  type  will  be  utilized. 
The  optimization  criterion  will  be  the  expected  average  cost  criterion. 

So,  we  want  to  solve  the  following  optimization  problem : Find  a (a,  S)  policy  that  minimizes 
the  expected  average  cost  over  an  infinite  planning  horizon  of  the  inventory  problem  with  the 
demand  and  ordering  process  and  the  cost  structure  described  above.* 

To  achieve  this  task,  we  will  employ  techniques  of  Markov  renewal  theory  (see  Cinlar  [1],  [2], 
and  Schellhaas  [5])  to  obtain  an  analytic  expression  for  the  expected  average  cost  associated  with  a 
given  (a,  S)  policy  (Sections  2 and  3)  to  reduce  the  optimization  problem  to  a straightforward 
search  procedure. 

The  proposed  development  will,  unfortunately,  necessitate  the  following  restrictive : 

ASSUMPTION : No  orders  can  overlap. 

In  other  words,  if  an  order  is  outstanding,  the  next  order  can  only  be  placed  after  the  arrival  of 
the  outstanding  order.  It  should  be  stressed  that  the  tools  of  Markov  renewal  theory  can  be 
employed  without  this  assumption;  however,  the  difficulties  arise  when  one  attempts  to  derive 
analytical  expressions  for  the  cost  rate  associated  with  (s,  S)  policies,  f 

In  order  to  put  the  present  paper  in  perspective,  we  will  compare  the  model  used  here  with 
those  of  Gross,  Harris,  a^d  Lechner  (GHL)  [4],  Gross  and  Harris  (GH)  [3],  and  Schellhaas  (S)  (6], 
the  latter  three  models  being  most  closely  related  to  the  present  one.  The  four  models  (including 
ours  (DK))  can  be  classified  according  to  the  assumptions  made  about  the  demand  process,  the 
leadtime,  the  inventory  policy,  and  the  cost  structure. 

As  to  the  demand  process,  (DK)  is  most  general;  in  this  model,  it  is  assumed  that  demand 
can  be  characterized  by  a compound  Poisson  process.  In  (GHL)  it  is  assumed  that  the  demand 
is  a compound  Poisson  process  with  the  restriction,  however,  that  the  demand-size  distribution 
assigns  probability  zero  to  demands  larger  than  two.  Finally,  in  (GH)  and  (S),  a simple  Poisson 
process  is  assumed  to  be  a proper  representation  of  the  demand  process. 

As  to  the  assumptions  made  about  the  leadtime,  the  four  models  split  into  two  groups:  on 
the  one  hand,  (GHL)  and  (GH),  where  a “state-dependent”  lead-time  distribution,  i.e.,  where 
the  lead-time  distribution  is  a function  of  the  number  of  outstanding  orders,  is  allowed  for,  and 
oathe  other  hand,  (S)  and  (DK),  where  the  lead-time  distribution  is  a fixed  (arbitrary)  distribution 
(with  finite  mean).  To  facilitate  theoretical  developments  with  state-dependent  lead  times,  distri- 
bution of  the  exponential  type  are  considered. 


•The  subsequent  analysis  can  be  adjusted  to  incorporate  the  expected  discounted  cost  criterion. 
fA  manageable  problem  results  when  {S  — l,  S)  policies  are  employed. 


j 
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With  respect  to  the  inventory  policies  used,  (GH)  and  (S)  use  («,  S)  policies.  In  (DK),  a («,  S) 
policy  is  utilized  provided  no  orders  can  overlap,*  whereas  a (5—1,  S)  policy  is  followed  in  the 
(GH)  model. 

Finally,  we  note  that  (GH),  (S),  and  (DK)  are  essentially  equivalent  in  cost  structure,  the 
distinguishing  feature  from  (GHL)  being  the  presence  of  a fixed  order  cost.  Indeed,  the  two  typos 
of  “imderage  costs”  considered  in  (GHL),  (GH),  and  (S)  can  easily  be  incorporated  in  (DK) 
but  this  was  not  considered  in  (DK)  for  brevity  of  exposition. 

In  summary,  wo  note  that  none  of  the  models  properly  generalizes  the  others;  the  “no  overlap” 
assumption  prevents  the  present  model  from  being  the  most  general.  We  may  also  note  that  the 
second  author  is  presently  developing  models  where  demand  processes  belong  to  the  class  of 
semi-Markov  processes,  and  hence,  constitute  a generalization  of  the  above  models  with  respect 
to  demand. 

2.  THE  INVENTORY  PROCESS  AS  A REGENERATIVE  STOCHASTIC  PROCESS. 

In  this  section  wo  introduce  some  results  from  Markov  renewal  theory  in  the  spirit  of  Cinlar 
[1]  to  apply  to  the  inventory  model  described  above. 

NOTATION 

Z:  integers 

N:  nonnegative  integers 
R:  reals 

R+:  nonnegative  real  numbers 

Consider  a probability  space  (0,  ff,  P)  and  the  random  variables 
X,  : Q^Z, 

T,  : tt-*R^, 

such  that  7(1=0,  for  all  ntN  and,  finally,  7*— »<»  almost  surely.  Then 

DEFINITION  1 : { (J1l„7,)  ; ntN}  forms  a Markov  renewal  process  with  state  space  Z if 

P{X,+i=j,  7,+i— 7,^f/X(,,  . . .,  X,;  7o,  . . .,  7,) 

=P(JY,+i= j,  7,+i— 7,)  for  all  jtZ  and  <*[0,  »). 

DEFINITION  2 : Let  {.^(O ; tfR+ ) be  any  stochastic  process  such  that  X(t)  =Xn  for  t=  7,. 
Then,  {X(() ; <tR+}  is  said  to  be  a regenerative  process  with  respect  to  a Markov  renewal  process 
{(X„  Tn);ntN}iI 

P{X{t)=jlXo X,Tt 7.;  X(r)  for  all  rs£7,) 

=P{X{t)=jlX„  7.)  for  all  t^T,,ntN. 

Now  wo  show  how  the  inventory  problem  of  Section  1 can  be  described  in  terminology  of 
Markov  renewal  theory.  To  do  so,  define  f 

(1)  as  a random  variable  taking  values  in  t/=  {5,  5—1,  . . .},  denoting  the  inventory 
level  at  time  period  7,;  we  assume  {X^,  Tt)  — ik,  0)  with  krSiS. 


*Thi8,  in  fact,  rules  out  "state-dependent”  lead  time  distributions. 

tFrom  now  on  we  will  assume  that  a particular  (s,  S)  policy  is  given,  in  order  to  avoid  cumbersome  notation. 
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(2)  T,,  1,  as  the  arrival  time  of  the  quantity  ordered  at  7’,_i  if  Xn-i=i,  with  i^s  and  as 

the  next  order  point  if  X,_i=i  with  8<ii^S. 

Furthermore,  in  order  to  characterize  the  distribution  of  the  length  of  a regeneration  interval, 
denote  by  the  probability  that  the  next  regeneration  point  occurs  in  a time  less  than  or  equal 
to  for  anyj’tt/. 

LEMMA  1 : For  a,ny  jtJ, 

10(0.  ifj=£8. 

* 

AO.  k.  t),  if  i>«. 

with 

AO.  k. 

PROOF:  If  quantity  S—j  is  ordered;  hence  Bf{t)  = G(.t).  Wheny>«,  observe  that 

P(D.=j-k). 


where  D , denotes  the  cumulative  demand  in  a time  period  of  length  t. 

PROPOSITION  1 : If  and  T„  are  defined  by  (1)  and  (2),  then  | {Xn,  T,) ; is  a Markov 
renewal  process. 

PROOF  ; For  any  ntN,  X^=i,  ktJ,  consider 
CASE  (i)  : i>8 


P(,^n+i—k,  Tm+i — T„^t/Xi„  . . .,  X„ — i;  To,  Ti,  . . .,  T„) 

(0,  if  fc>s, 

~ir  P^^-^r=k/Xo,  . . .,  X„=v,  To,  Tu  . . .,  TMBM, 

if  k^8, 
if  ifc>«, 

P(i-D,=klX,=i,  T„)dBM, 
if  k^8. 

CASE  (ii): 

P{X„^,=k,  T„^,-T,^tlXo,  . . .,  X„=i-,  To,  Tu  . . .,  T„) 


r 

=<  rt 


= £piS-D,=k/Xo X„=i;  To,  T, T.)dG(r) 

=£p(.S-Dr=k)dG{T). 


The  following  proposition  is  then  immediate. 

PROPOSITION  2;  Let  X(t)  be  a random  variable  indicating  the  inventory  level  at  time  t. 
Then  |X(t);  t>0|  is  a regenerative  process  with  respect  to  the  Markov  renewal  process  defined 

by  (1)  - (2). 
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Now  that  it  is  established  that  T*);  neA^|  is  a Markov  renewal  process,  it  is,  of  course, 
well-known  that  {X,;  ntN]  is  a Markov  chain.  To  characterize  the  transition  structure  of  this 
imbedded  Markov  chain,  let 

Pij=P(X,+i=jlX^=i) 


so  that,  for  any  pair  i,  jtJ 


0, 

if  i>s,  j>8, 

(3) 

P»= 

Mi,  j,  ■r)dB,{T), 

if  iys^j, 

In  fact. 

jjMS.j,  r)dG{r), 

if  i:Ss. 

LEMMA  2;  The  Markov  chain  {.y,;  n*iV)  is  ergodic. 

PROOF : Aperiodicity  and  irreducibility  follow  from  (3).  To  show  positive  recurrence,  we  first 
exhibit  an  explicit  solution  to  the  following  system: 


(4) 

(6) 

(6) 


with  pii  as  in  (3). 

For  all  i ^ we  rewrite  (4)  as 

(7) 

with  ai=pji  for  all  j^s. 

For  « <i^ S,  we  obtain  for  (4) 

(8) 

Define 

(9) 

with 

and 


n,=^p„ny,  itJ, 
n<^o,  itJ, 

s $ 

n<=  X)  p,iTi,+ai  S R) 

} — « 


n:= 


n< — 2 Ry- 


\aiK,  if  « <1^5, 
biK,  ifi^s, 
a 


The  {n/}  defined  in  (9)  solve  the  system  (5)  - (8),  establishing  the  lemma. 

Lemma  2 will  not  only  prove  to  be  useful  in  subsequent  theoretical  developments,  but  also 
will  provide  us  with  an  explicit  solution  (see  (9))  for  the  stationary  distribution  of  the  Markov 
chain  {.AT.I. 
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In  order  to  be  able  to  invoke  some  results  of  Schellhaas  [5],  we  need  some  definitions. 
Let 

u)=P(X(,t)=k/(X(o)=j,  Tx=u)  for 

and 

^,,it)=P{X(t)=k,  t<T,<a>IX{0)=i), 


so  that 
Hence, 

(10) 


for  fc>i>8  or 

A (j,  k,  t)dB, («),  for  a<k^j, 

A {j,  k,t)d6  (u)  for  k^j ^s. 


The  expected  length  of  a regeneration  interval  with  initial  state  j is  denoted  by  m,,  i.e., 
(11)  m,-£  tdBjit),  jtJ. 


If  PAt)=P(Xit)=klX(.0)^j),  then 
THEOREM  1:  For  any  ktJ 


(12) 


Pt=)imPit(t)  = 


^ Hi  ^i^(t)di 


PROOF;  To  apply  a result  of  Schellhaas  (Ref.  [5],  Eorrolar  1.1.,  p.  12),  it  suflBces  to  observe 
that  the  imbedded  Markov  chain  is  ergodic  (Lemma  2),  that  B^{t)  is  continuous,  and  that  ^>t(0  is 
Riemann-integrable. 

If  Vj{t)  denotes  the  expected  cost  associated  with  a particular  («,  S)  policy  during  a time 
interval  (0,  <]  if  X(0)=j,  our  interest  lies  in  the  limiting  behavior  of  r‘V^((t).  In  fact,  we  have  the 
following: 

THEOREM  2:  The  limit  of  exists  for  alljeJ;  in  fact. 


with 

9 = ^ + 9'-\-9^ 

and 

f 

(13) 

(14) 

p‘=c.giPT, 

(18) 

/=c.  ± jPI, 

n. 


CONTINTJOUS  RBVrEW  INVENTORY  MODEL  683 

where  the  n/s  are  the  solution  of  (4)-(6),  the  m/s  are  defined  by  (11),  the  PJ’s  are  ^ven  in  (12), 
and 

m'=J'  tdO(t). 


REMARK:  The  fact  that  g=g'‘+g^+9^  reflects  the  additive  cost  structure;  the  avera^  cost 
associated  with  the  ordering  process,  the  inventory  costs,  and  the  backorder  costs  are  added  up 
to  obtain  the  overall  average  costs.  Note  the  intuitive  interpretation  of  (13)-(15).  In  (13),  for 
instance. 


is  the  expected  inventory  level,  given  that  levels  below  « are  considered. 

PROOF:  The  theorem  is  an  easy  consequence  of  a result  of  Schellhaas  (Ref.  [5],  Korrolar  2.1., 
p.  21)  which,  in  turn,  can  be  established  by  standard  renewal  theoretic  techniques,  cf.  Cinlar  [2]. 
Schellhaas  proved  that,  if  the  embedded  Markov  chain  is  ergodic,  and  for  a cost  structure  which 
has  a finite  expected  value  on  finite  intervals. 


(16) 


g=\im 


YM 

t 


^n,m/ 


where  py  is  the  expected  cost  in  a regeneration  interval  with  initial  state  j. 

This  result  will  be  applied  to  ordering,  inventory  carrying,  and  backorder  costs.  For  the 
ordering  costs  we  have 

0,  if  i>«, 

2 (Co4-Co(-S— ifi^«. 


Hence, 


So, 


|0,  if  j>8, 

Wo+ColS— i), 


f- 


^ ny(Co+Co(S— j)) 

-~a  a 

S >nmy+  S Dym' 


and  (13)  is  established. 

Observe  that  for  the  case  of  the  inventory  carrying  costs. 


fO,  ifi:S0, 


Substitution  in  (16)  and  changing  the  order  of  summation  gives,  then,  (14).  The  same  argument 
applies  to  the  backorder  costs. 

Theorems  1 and  2 give  us  an  explicit  method  to  compute  the  expected  average  cost  of  a par- 
ticular («,  S)  policy;  however,  we  will  see  in  the  next  section  how  these  results  can  be  simplified 
from  a computational  point  of  view. 
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3.  SOME  ANALYTICAL  RESULTS. 


In  this  section,  we  exploit  the  properties  of  the  compound  Poisson  process  in  order  to  obtain 
manageable  analytical  expressions  for  the  “cost  rates”  (13)-(15). 

In  view  of  (9),  it  is  clear  that  an  expression  for  the  stationary  distribution  can  be  obtained 
if  the  transition  probabilities  (see  (3))  can  be  explicitly  computed.  So 
LEMMA  3;  For  any  given  («,  S)  policy, 


0, 

if  S^i>a,  j>8, 

t+i-J 

if  t=«+l,  j^«. 

t-i  i-t-l  k „(m)  1 

(/+«-!)!/ 
h 1!2‘+*  V 

2nJ 

if  <S'3:i^«+l>j, 

if  i^S*. 

PROOF:  Consider  the  case  where  S>i>a-i-l  and  j<8.  Then 


so  that 


r.,-/;  A(i,  J.  t)d  .r ). 

/;  ^(i.  >.  'g  g < />(i.  i,  <>(«“ 


Substituting  for  A (i,  j,  t)  and  exploiting  the  properties  of  the  gamma  density  gives 

Hence,  the  result  for  is  established.  The  other  cases  are  trivial. 

We  also  have,  in  view  of  Lemma  1 and  (11), 

LEMMA  4:  For  any  itJ, 

(J*  td6{t),  if  i^s. 


mj=  ->  if  i=«-f-i, 

K‘+'§  .S"*")’  *">•+>• 

Furthermore,  (and  this  is  useful  to  compute  the  values  of  PJ  (see  Theorem  1)) 
LEMMA  5:  For  any  itJ 


0, 

i-$-i  k i-k 


(-*  i-k  Cm 

S “§  Jo  ^ ^ 


i>«,  ifc>i  ork^a, 
if  •<*«», 
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(For  simplicity  oj*  ■ 1.) 

The  proof  of  Lemma  4 and  5 is  similar  to  that  of  Lemma  3. 

The  results  can  be  used  to  evaluate  (13)  — (15).  Unfortunately,  two  infinite  series 


are  still  to  be  evaluated.  In  practical  cases,  these  series  would  have  to  be  approximated;  this  is,  of 
course,  numerically  speaking,  not  an  unsiumountable  task. 

Further  simplifications  arise  if  a special  form  of  the  lead-time  distribution  is  assumed.  For 
instance,  if  fl'(0  = 1 — then  for  allj^s 


s~i 


mX' 


*‘“^'‘“jS(X  + M)'+‘®‘"‘ 


and  setting 


e= 


X + M 


n,= 


S-l 


X-f-M 


if  S^i^. 


where  K is  defined  as  in  (9).  Of  course,  the  expressions  for  po,  m,,  and  Pt  simplify  accordingly. 

The  results  for  the  case  when  the  demand  occurs  according  to  a Poisson  process  can  be  obtained 
by  simplifying  all  previous  expressions  to  accoimt  for  the  fact  that  in  this  case  01*^  =0  whenever 
t^n. 

If  we  now  consider  the  overall  expected  average  cost  as  a function  of  the  policy  used,  i.e., 
g($,  S),  then  it  follows  that  p(-,  5)  is  a unimodal  function,  and  if  we  denote  «*(S)  as 


p(#*(S),  S)-min  p(f,  5), 

»<s 


the  optimal  solution  is  found  by  minimising  p(«*(S),  S)  over  S.  To  achieve  this  task,  simple  search 
procedures  can  be  used. 
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ABSTRACT 

Two  issues  of  frequent  importance  in  new  product  development  are  product 
improvement  and  reliability  testing.  A question  often  faced  by  the  developer  is: 

Should  the  product  be  distributed  in  its  present  state,  or  should  it  be  improved 
further  and/or  tested  before  distribution?  A more  useful  statement  of  the  question 
might  be;  What  levels  of  investment  in  further  improvement  and  testing  are 
economically  permissible?  Products  for  which  this  question  is  relevant  may  vary 
widely  in  type  and  intended  use.  This  paper  presents  a model  for  determining  these 
levels  for  one  such  product — an  equipment  modification  procedure.  The  model 
presented  makes  use  of  present  value  analysis  to  compare  cost  streams  and  of 
Bayesian  statistics  to  relate  the  costs  to  various  outcomes  under  conditions  of 
uncertainty.  The  model  is  applied  to  an  actual  military  problem  and  a method  is 
described  for  examining  the  sensitivity  of  the  results  to  changes  in  the  prior  prob- 
abilities and  discount  rate.  j 


INTRODUCTION 

It  is  hardly  profound  to  suggest  that  the  more  a decision  maker  knows  about  a process  and 

the  greater  the  sensitivity  of  the  process  to  the  resources  under  his  control,  the  better  and  more 

I influential  his  decisions  are  likely  to  be.  But  when  the  additional  information  and  capability  must 

^ be  secured  at  a price,  and  when  both  come  wrapped  in  a cloak  of  uncertainty,  a more  relevant 

question  might  be:  “How  much  should  the  decisionmaker  pay  for  the  additional  information  and 
capability?” 

The  relevance  of  this  question  is  perhaps  most  lucid  in  terms  of  a business  firm  developing  a 
product  for  sale.  The  typical  firm  would  normally  undertake  product  improvement  if  the  cost  of 
the  improvement  program  was  less  than  the  expected  increase  in  revenue.  Similarly,  a firm  would 
normally  undertake  a product  testing  program  if  it  anticipated  that  the  information  to  be  gained 
from  the  testing  would  lead  to  additional  marginal  revenues  expected  to  be  in  excess  of  the  margi- 
nal costs  of  testing. 

•Colonel,  U.S.  Army. 
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Thus,  two  distinct  but  related  issues  are  involved — product  improvement  and  product  test- 
ing. The  issue  of  product  improvement,  although  it  might  well  involve  subjective  prior  estimates 
of  expected  product  marketability,  is  rather  straightforward  and  presents  no  significant  compu- 
tational complexities.  The  second  issue,  however — that  of  testing  or,  more  generally,  information 
gathering — is  far  more  interesting  and  challenging  since  it  involves  an  effort  to  upgrade  decision 
quality.  This  effort  is  based  not  only  on  product  improvement  but  also  on  acquiring  additional 
information  about  the  level  of  improvement  that  the  program  is  expected  to  achieve.  In  other 
words,  the  first  issue  deals  only  with  upgrading  the  product,  whereas  the  second  issue  deals  with 
upgrading  the  decision  to  market  as  well  as  product  improvement. 

It  is  also  important  to  note  that  both  issues  involve  investing  money  now  to  save  money  (or 
increase  revenues)  later.  In  other  words,  the  typical  firm  that  undertakes  product  improvement 
and  testing  is,  in  effect,  delaying  the  marketing  date  (and  the  consequent  revenues)  in  anticipa- 
tion of  higher  revenues  later  from  the  improved  product.  By  comparing  the  anticipated  increases 
in  revenue  to  the  costs  of  product  improvement  and/or  testing,  the  firm  is  able  to  ascertain  if 
these  efforts  should  be  undertaken  and  to  estimate  the  levels  of  expenditure  permissible  for  both. 

The  decision  procedure  suggested  by  this  discussion  is  relevant  with  respect  to  many  decision 
opportunities.  This  paper  describes  the  application  to  a decision  opportunity  within  the  military 
involving  the  development  and  testing  of  an  equipment  modification  procedure.  Three  broad  ques- 
tions are  sequentially  addressed : 

1.  Should  the  modification  procedure  be  applied  in  any  form? 

2.  Should  the  modification  procedure  be  applied  without  further  development  and/or 
testing,  or  should  a program  be  undertaken  to  improve  and/or  test  the  procedure  before 
field  application? 

3.  How  much  is  it  permissible  to  spend  on  further  development  and  testing  of  the 
modification  procedure? 

The  decision  procedure  described  in  this  paper  is  referred  to  as  the  Bayesian  Approach  because 
of  its  use  of  the  Bayes’  Probability  Theorem  and  the  Bayes’  Decision  Procedure  (see,  e.g.,  Raiffa 
[3]).  Bayes’  Theorem  is  used  to  develop  posterior  probabilities  from  subjectively  determined  prior 
probabilities  and  subsequent  testing  information;  expected  value  statistics  based  on  these  prob- 
abilities are  used  to  make  expected  cost  comparisons  among  the  competing  alternatives.  The  paper 
concludes  by  presenting  the  process  in  decision  tree  format  and  by  examining  the  sensitivity  of  the 
results  to  changes  in  certain  model  parameters. 

THE  PROBLEM  EXAMINED 

During  the  late  1960’s,  the  U.S.  military  in  South  Vietnam  began  experiencing  a high  failure 
rate  on  a certain  subassembly  of  one  of  its  high-density  electric  power  generators.  Because  the  high 
failure  rate  had  become  quite  costly  in  terms  of  replacement  costs  and  generator  downtime,  it 
became  necessary  for  the  responsible  logistical  command  to  analyze  the  problem  and  consider 
various  ways  in  which  the  situation  could  be  corrected.  The  procedure  described  in  this  paper 
approximates  the  analysis  that  was  conducted  to  select  an  appropriate  corrective  alternative. 

"The  problem  is  presented  in  detail  to  permit  the  solution  procedure  to  be  developed  in  a rigorous 
manner.  'The  reader  should  recognize,  however,  that  both  the  problem  and  the  procedure  are  very 
general  in  character  and  that  the  procedure  is  applicable  to  a wide  variety  of  problems  related  to 
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mformatiuu  gathering  and  sequential  decision  making  under  conditions  of  uncertainty.  It  is  hoped 
that  the  reader  will  keep  this  generality  in  mind  while  reviewing  the  specific  application  described. 

THE  PRIOR  PROBABILITIES  OF  FAILURE  RATE  REDUCTION 

The  initial  stages  of  the  decision-making  process  followed  a rather  typical  pattern.  For  example, 
the  engineers  reviewed  the  technical  aspects  of  the  problem  and  foimd  that  a modification  procedure 
could  be  applied  to  the  generators  that  would  result  in  reductions  in  the  failure  rate.  Although  the 
modification  procedure  could  be  applied  without  further  development,  the  procedure  was  somewhat 
makeshift  in  its  initial  state,  and  the  developing  engineers  felt  that  the  likelihood  of  achieving 
significant  failure  rate  reduction  was  relatively  low.  When  asked  to  quantify  their  feelings,  the 
engineers  responded  that  they  could  not  be  too  precise  about  the  level  of  failure  rate  reduction  that 
would  be  achieved,  but  could  provide  estimates  of  the  reduction  probabilities  at  four  aggregate 
levels.  The  probability  estimates  provided  by  the  engineers  are  listed  in  Table  1. 

The  developing  engineers  further  indicated  that  if  they  were  permitted  to  improve  the 
modification  procedure  before  it  was  applied,  the  probabilities  associated  with  the  four  levels  of 
failure  rate  reduction  could  be  substantially  increased.  Specifically,  the  engineers  pointed  out 
that  if  they  were  given  six  more  months  in  which  to  improve  the  procedure,  the  probabilities 
associated  with  the  four  states  of  failure  rate  reduction  could  be  increased  to  the  levels  indicated 
in  Table  2. 


TABLE  1.  Prior  Probabilities  of  Failure  Rale  Reduction  WUhtnU  Further  Modification  Development 


LEVEL  OF  FAILURE 

PROBABILITY  ESTIMATE 

STATE 

RATE  REDUCTION 

OF  ACHIEVING 
REDUCTION  LEVEL 

(Sd 

(X) 

P(Si) 

Si 

60% 

0.  10 

s, 

40% 

20% 

0.50 

s. 

0% 

0.  20 

TABLE  2.  Prior  Probabilities  of  Failure  Rate  Reduction  With  Further  Modification  Development 


STATE 

LEVEL  OF  FAILURE  RATE 

PROBABILITY  ESTIMATE 

REDUCTION 

OF  ACHIEVING 
REDUCTION  LEVEL 

(Si) 

iX) 

P{S.) 

Si 

60% 

0.  25 

s. 

40% 

0.50 

St 

20% 

0.20 

St 

0% 

0.  05 
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The  generator  failures  were  of  such  a nature  that  personnel  safety  was  not  significantly 
endangered.  Consequently,  the  analysis  was  based  purely  on  cost — or,  more  correctly,  “expected” 
cost,  in  order  to  recognize  the  future  nature  of  such  costs  and  the  uncertainty  associated  with 
each  level  of  failure  rate  reduction. 

In  establishing  the  relevant  costs,  it  was  assumed  that  the  military  intended  to  replace  all  of 
the  generators  of  this  type  with  a newer,  more  efficient  model  3M  years  from  the  decision  date.  For 
this  reason,  a 3)i-year  cost  stream  was  used  in  determining  the  total  cost  of  system  repair.  Further- 
more, it  was  estimated  that  the  cost  of  modifying  all  of  the  generators  currently  in  the  inventory 
would  be  $10  million  for  both  the  unimproved  and  the  improved  versions  of  the  modification 
procedure.  Additionally,  the  costs  of  modification  improvement  and  testing,  if  undertaken  either 
collectively  or  separately,  would  also  be  incurred  during  the  first  six  months  following  the  decision 
date;  however,  these  costs  were  treated  as  model  outputs  and,  hence,  as  unknowns  in  the  initial 
cost  estimates. 

For  ease  of  computation,  it  was  assumed  that  if  the  modification  was  applied  without  further 
development,  the  modification  cost  of  $10  million  would  be  incurred  immediately  and,  thus,  the 
modified  system  repair  costs  would  be  incurred  over  the  full  3K  years  remaining  in  the  generator’s 
life  cycle.  Conversely,  if  the  modification  was  improved  before  application,  it  was  assumed  that  it 
would  be  applied  at  the  end  of  six  months  from  the  decision  date.  Thus,  in  the  latter  case,  present 
system  repair  costs  would  be  incurred  for  the  first  six  months  and  modified  system  repair  costs 
would  be  incurred  for  the  final  three  years  of  the  generator  life  cycle.  Finally,  if  the  modification 
procedure  was  tested  without  improvement  before  the  application  decision,  it  was  estimated  that 
such  testing  would  require  two  months  and  that  the  modification,  if  applied,  would  be  applied  at 
the  end  of  two  months  from  the  decision  date.  The  latter  alternative  and  the  relevant  costs  are 
discussed  more  fully  in  a later  section  of  the  paper. 

With  these  cost  data,  the  various  cost  streams  were  then  compared  using  a present  value 
analysis.  A discount  rate  of  10  percent  was  used  since  the  problem  addressed  involved  an  existing 
program  considered  to  be  covered  by  the  provisions  of  Office  of  Management  and  Budget  Circular 
A-94  [4]  which  was  interpreted  as  requiring  an  annual  rate  of  10  percent.  The  equation  used  to 
compute  the  present  value  {PV)  of  the  cost  streams  was 

PF=  Co-b  <7,(1 Cid 

where  C'<=cost  associated  with  period  i {Co  represents  initial  costs), 
and  i=monthly  discount  rate.* 


*The  monthly  discount  rate,  t,  was  selected  such  that  the  present  value  of  a cost  stream  discounted  monthly 
at  ,'%  is  the  same  as  the  present  value  of  the  cost  stream  discounted  annually  at  r%.  That  is,  t=  (l  + r)''**— 1.0. 

Thus,  for  r=10%.  f =0.007974. 
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The  relevant  cost  statistics  are  listed  in  Table  3. 


TABLE  3.  Syttem  Co$Ufor  All  PoasMe  Improvement  LeveU 
(I  MILLION) 


ACT 

STATE 

INITIAL 

PERIODIC  REPAIR  COST! 

PRESENT 

(5.) 

COST 

1 

2 

3 

4 

VALUE 

Modify  without  further 

Sr 

10 

4 

8 

8 

8 

32.  78 

development 

St 

10 

6 

12 

12 

12 

44. 17 

S, 

10 

8 

16 

16 

16 

56.  67 

St 

10 

10 

20 

20 

20 

66.96 

Modify  with  further 

Sr 

0* 

20t 

8 

8 

8 

38.04 

development 

St 

0* 

20t 

12 

12 

12 

47.  52 

St 

0* 

20t 

16 

16 

16 

57.  01 

St 

0* 

20t 

20 

20 

20 

66.  49 

No  Modify 

St 

0 

10 

20 

20 

20 

56.96 

*Co«t  of  modification  development  and/or  testing  not  included.  The  permissible  expenditures  for  these  efforts 
are  model  outputs. 

flncludes  six-month  repair  cost  without  modification  plus  110  million  for  cost  of  modification  application. 
{First  period  is  six  months  in  length;  others  are  12  months. 


MODIFY  OF  NOT  MODIFY? 

The  first  question  examined  in  the  analysis  was  whether  or  not  to  imdertake  modification 
at  all.  The  decision  was  based  on  a comparison  of  expected  system  costs  over  the  S^-year  period 
wiUi  and  without  system  modification.  The  expected  system  cost  without  modifying  the  generators 
was  $56.96  million,  which  may  be  read  directly  from  the  last  row  in  Table  3.  The  expected  system 
cost  with  modification  but  without  further  development  of  the  modification  procedure  was  com- 
puted using  the  equation 

£«7*(=FV,P(5.)  +PV^m  +PV^P(Sn)  +PVJ>{S,) , 

where  system  cost  with  modification  but  without  further  development  of  the  modification 

procedure, 

P(i9<) withe  prior  probability  associated  with  state  i without  further  development  (i=il, 
2,  3,  4)  (Table  1), 

and  PFiaethe  present  value  of  the  cost  stream  associated  with  state  i without  further  de- 
velopment (i=l,  2,  3,  4)  (Table  3). 

Thus, 

^«7»)-(0.10)  (32.78) +(0.20)  (44.17) 

+ (0.80)  (58.67) + (0.20)  (66.96) 

>3.28+8.83+27.79+ 13.39 
>$53.29  million. 
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Since  the  expected  system  cost  without  modification  was  $56.96  million,  a savings  of  $3.67million 
(i.e.,  $56.96— $53.29)  was  potentially  realizable  by  applying  the  modification,  even  without  further 
development  and  testing.  Thus,  based  on  the  prior  probabilities  given,  the  decision  was  properly 
to  apply  the  modification. 

FURTHER  DEVELOPMENT  OR  NO  FURTHER  DEVELOPMENT? 

The  next  decision  opportunity  considered  was  whether  to  apply  the  modification  without 
further  development  or  to  undertake  a program  to  improve  the  modification  procedure  before 
field  application.  This  decision  was  made  by  comparing  expected  system  costs  if  the  modification 
was  applied  without  improvement  and  testing,  E(C*) , to  expected  system  costs  if  the  modification  was 
improved  but  not  tested  before  application,  E{C**).  The  latter  expected  cost  was  computed  using 
the  equation 

E{C**) =py.p(s,)  -1-PF,P(S,) +PVJ>(S,) +py*P(S4) , 

where  C'**= system  cost  with  modification  and  with  further  development  of  the  modification 

procedure, 

P(/Si)=the  prior  probability  associated  with  state  i with  further  development  (i=l,  2,  3, 
4)  (Table  2), 

and  PFi=the  present  value  of  the  cost  stream  associated  with  state  i with  further  develop- 
ment (i=l,  2,  3,  4)  (Table  3). 

Thus, 

£((7**)  = (0.25)  (38.04)  4- (0.50)  (47.52) 

-|-(0.20)  (57.01)4- (0.05)  (66.49) 

=9.514-23.764-11.404-3.33 
=$48.00  million. 

Since  the  expected  system  cost  using  an  unimproved  modification  procedure  was  $53.29  million,  it 
was  clearly  economical  to  undertake  the  improvement  program,  as  the  potential  existed  for  ad- 
ditional savings  in  the  amount  of  $5.29  million  (i.e.,  $53.29— $48.00). 

TESTING  OR  NO  TESTING? 

It  would  have  been  quite  possible,  of  course,  to  undertake  an  improvement  program  without 
first  subjecting  the  subsystem  to  failure  testing  with  the  improved  modification  applied.  Accordingly, 
the  third  major  question  considered  was:  How  much  money  should  be  devoted  to  failure  (or 
reliability)  testing  in  conjunction  with  the  improvement  program  before  making  the  application 
decision? 

Clearly,  the  most  that  could  be  acquired  from  such  testing  was  perfect  information  concerning 
the  level  of  modification  development  achievable.  From  Table  3 it  may  be  deduced  that  if  perfect 
information  had  been  available  after  development  and  before  the  application  decision,  the  decision- 
maker would  have  chosen  to  modify  the  generators  only  if  the  level  of  failure  rate  reduction  was 
either  state  S\  (60%)  or  state  Sa  (40%) . If  the  level  of  failure  rate  reduction  was  either  state  S|  (20%) 
or  state  St  (0%),  the  decisionmaker  would  have  chosen  not  to  apply  the  modification,  since  with  the 
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latter  two  states  the  system  cost  with  modification  would  have  exceeded  the  system  cost  without 
modification. 

The  value  of  perfect  information,  then,  in  this  problem,  may  be  viewed  as  the  expected  savings 
that  would  have  accrued  to  the  decisionmaker  by  choosing  not  to  modify  if  the  true  state  was 
either  St  or  S4.  In  terms  of  cost,  the  value  of  perfect  information  may  be  expressed  as  the  difference 
between  the  expected  cost  if  the  true  states  were  known  with  certainty  and  the  expected  cost  when 
the  only  information  about  the  states  was  the  set  of  prior  probabilities  set  forth  in  Table  2.  The 
latter  expected  cost,  of  course,  was  E{C**),  or  $48.00  million,  as  previously  computed.  The  expected 
system  cost  corresponding  to  perfect  information,  denoted  E{C**\Pr),  was  computed  as  follows: 

E{C**\PD  = (0.25)  (38.04)  + (0.50)  (47.52)  + 

(0.20)  (56.96)  + (0.05)  (56.96) 

=9.51+23.76+11.39+2.85 
=$47.51  million. 

Therefore,  the  expected  value  of  perfect  information,  denoted  E{PI),  was 

E{Pr)  =E{C**) -E{C** \PT) 

=48.00-47.51 
=$0.49  million. 

This,  then,  was  the  greatest  amount  that  could  be  spent  on  failure  testing,  even  if  the  testing  could 
have  produced  perfect  information.  In  reality,  of  course,  no  testing  procedure  yields  perfect  informa- 
tion. Therefore,  the  maximum  level  of  expenditure  allowable  for  testing  was  obviously  somewhat 
lower  than  this  upper  bond,  and,  in  fact,  depended  on  the  reliability  of  the  test  procedure  which 
will  be  discussed  in  the  paragraphs  to  follow. 

THE  TEST  PROCEDURE. 

The  failure  test  to  which  the  subsystem  (with  the  improved  modification  applied)  was  to  be 
subjected  involved  a testing  procedure  that  had  been  used  many  times  in  the  past  and  on  which  an 
acceptable  set  of  reliability  data  had  been  accumulated.  Experience  with  the  test  had  indicated 
that  the  test  results  could  be  divided  into  the  four  groups  listed  in  Table  4.* 


TABLE  4.  Relationship  Between  Test  Outcome  and  Level  of  Failure  Rate  Reduction 


TEST  OUTCOME  {T,) 

REDUCTION  IN  FAILURE  RATE 

r, 

60%(  + ) 

Tt 

40-60% 

Tt 

20-40% 

Tt 

0-20% 

•The  analysis  considered  the  test  procedure  at  a single  fixed  level.  That  is,  the  test  could  only  be  conducted  or 
not  conducted;  it  could  not  be  conduct^  at  varying  levels.  An  interesting  extension  to  the  model  would  be  to  relax 
this  assumption  and  permit  alternative  testing  levels. 
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It  had  al«o  been  found,  by  comparing  past  test  outcomes  to  the  subsequent  levels  of  product 
improvement,  that  once  the  levels  of  product  improvement  were  known,  the  probabilities  associated 
with  the  previous  test  results  were  distributed  over  the  full  range  of  possible  test  outcomes  and 
could  be  estimated  from  the  historical  data.  That  is,  if  we  define 

P(7V|5<)=probability  that  the  previous  test  result  was  T}  if  the  true  state  (level  of  improvement) 
turned  out  to  be  St, 

the  resulting  conditional  probabilities  (test  outcome  likelihoods)  for  the  four  states  and  four  test 
outcomes  were  derivable  from  past  test  usage.  A complete  listing  of  the  likelihoods  for  the  test  is 
set  forth  in  Table  5. 

TABLE  6.  Test  Outcome  Likelihoods 


P(Tilfi^i) 


STATE 

TEST  OUTCOMES 

r, 

Tt 

r. 

Tt 

5, 

0.900 

0. 060 

0.025 

0.025 

St 

0. 050 

0.  876 

0. 060 

0.025 

s. 

0. 026 

0.  050 

0. 876 

0.060 

St 

0.026 

0.025 

0.060 

0.900 

Next,  the  probabilities  listed  in  Tables  2 and  5 were  used  to  compute  the  joint  probabilities 
that  each  combination  of  test  outcome  and  failure  rate  reduction  level  would  occur.  These  proba- 
bilities were  developed  from  the  following  expression  for  the  test  outcome  likelihoods: 

n /a.  I o N P (Both  Si  and  r,  occur) 

P(Si  occurs) 

PCSiClTj). 

P(8,) 

which,  when  rearranged,  yielded  the  following  equation  for  the  joint  probabilities: 

P(S,m^P{T^Si)P{Si). 

Thus,  the  joint  probabilities  were  found  by  taking  the  matrix  product  of  Tables  2 and  6.  A complete 
summary  of  these  joint  probabilities  is  given  in  Table  6. 

Table  6 also  reflects,  along  the  bottom  row,  the  marginal  probahility  associated  with  each 
test  outcome.  These  probabilities  were  computed  using  the  equation 

P(r,)=P(r,is,)P(Si)+P(r,is,)P(s,)-i-P(r,ls,)P(S,)-i-P(r,iS4)P(S4), 


which,  in  effect,  is  a summation  of  the  column  entries  in  Table  6. 
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TABLE  6.  Joint  Probabilities  That  Both  the  Test  Outcomes  and  the  True  States  of  Modijieation 

Improvement  Occur  P{Si  0 Tj) 


Si 

0. 2250 

0.  0125 

0.0063 

0.  0062 

0.  2500 

s, 

0.  0250 

0.  4375 

0.  0250 

0.  0125 

0.5000 

s. 

0.  0050 

0.  0100 

0. 1750 

0.0100 

0.2000 

S4 

0.  0013 

0.  0012 

0.  0025 

0.  0450 

0.  0500 

P(P/) 

0.  2563 

0.  4612 

0.  2088 

0.  0737 

1.000 

The  final  step  in  computing  the  relevant  probabilities  was  to  find  the  posterior  probability 
associated  with  each  level  of  improvement,  conditioned  on  the  test  outcomes.  The  conditional 
relationship  used  to  find  these  posterior  probabilities  was 

PiSi\T,)=^^i^- 

A complete  summary  of  these  posterior  probabilities  is  given  in  Table  7. 

TABLE  7.  Posterior  ProbahUities  of  Achieving  Staled  Levels  of  Modification  Improvement  Conditioned 

on  Test  Outcomes 


P{S,\T,) 


Si 

0.  8779 

0. 0271 

0.  0302 

0.0841 

s, 

0.  0975 

0.  9486 

0.  1197 

0.  1696 

S, 

0.  0195 

0.  0217 

0.  8381 

0.  1357 

S4 

0.  0051 

0.0026 

0.  0120 

0.  6106 

EXPECTED  SYSTEM  COSTS  CONDITIONED  ON  TEST  RESULTS. 

The  next  step  in  the  decision  procedure  was  to  use  the  probabilities  in  Table  7 to  compute  the 
expected  3)i-year  system  cost  associated  with  each  test  outcome.  These  costs  were  computed  using 
the  equation 

E(c**iT,)  =p  (Sr\T,)P  (s,ir,)py,+ 

P(S,\T^)PV»+PiS,\Tj)PV,. 
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Summing  these  expected  costs  for  all  four  levels  of  improvement,  then,  gave  the  expected  cost 
associated  with  each  test  outcome.  That  b, 

TO  = (0.  8779) (38.  04)  + (0. 0975)  (47.  52)  + 

(0.  0195)  (57.  01)  + (0.  0051)  (66.  49) 

=33.40+4.  63+1.  11+0.34 
=$39. 48  million, 

E{C**\TO  = {0.  0271)(38.  04)  + (0.  9486)(47.  52)  + 

(0.  0217)  (57.  01)  + (0.  0026)  (66.  49) 

= 1.  03+45.  08+1.  24+0.  17 
=$47.  52  million, 

E(C**\ TO  = (0.  0302)  (38.  04)  + (0.  1 197)  (47.  52)  + 

(0.  8381)  (57.  01  + (0.  0120)  (66. 49) 

= 1.  15+5.  69+47.  78+0.  80 
=$55. 42  million, 

and 

E{C**\T0  = (.0.  0841) (38.  04)  + (0.  1696) (47.  52)  + 

(0.  1357)  (57.  01)  + (0.  6106)  (66.  49) 

=3.  20+8.  06+7.  74+40.  60 
=$59.  60  million. 

The  expected  system  cost  associated  with  each  test  outcome  was  then  used,  in  conjunction 
with  the  marginal  probability  of  each  test  outcome,  to  compute  the  expected  system  cost,  given 
only  that  testing  was  conducted.  This  cost  was  computed  using  the  equation 

E(C**lTEST)=P(r,)  E((7**|r, )+?(?,)  E(C**|r,)+ 

P(Tt)  E{C**\Tt)+P{T0  E(C**\T0. 

Summing  these  expected  costs  for  all  four  test  outcomes,  then,  gave  the  expected  system  cost 
conditioned  only  on  the  test  being  conducted.  That  is, 

£:(C**|TEST)  = (0.  2563)  (39.  48)  + (0.  4612)  (47.  52)  + 

(0.  2088)  (55.  42)  + (0.  0737)  (56.  96) 

= 10. 12+21.  92+11.  57  +4.  20 
=$47.  81  million. 

Note  that  if  the  test  result  turned  out  to  be  T4,  the  decisionmaker  would  choose  not  to  apply  the 
modification,  since  the  system  cost  with  modification  ($59.  60  mil)  would  exceed  the  system  cost 
without  modification  ($56.  96 mil).  This,  in  essence,  constituted  the  return  from  the  testing  effort. 

PERMISSIBLE  EXPENDITURES  ON  FURTHER  DEVELOPMENT  AND  TESTING. 

As  indicated  earlier  in  the  paper,  the  major  question  faced  by  the  decisionmaker  was:  How 
much  was  it  economically  permissible  to  spend  on  further  development  and  testing?  The  answer  to 
thb  question  was  computed  in  two  parts. 


BATESIAN  MILITART  mVEOTMENTS  EVALUATION 


697 


First,  the  maximum  level  that  could  be  spent  on  further  development  only  (without  testing), 
denoted  Ac,  was  computed  as  the  difference  between  the  expected  system  cost  if  the  modification 
was  applied  without  improvement,  E{jO*),  and  the  expected  system  cost  if  the  modification  was 
improved  but  not  tested  before  application,  E{C**).  That  is, 

ko-=E{,C*)-E{C**) 

=53.29-48.00 
=$5.29  million. 

Next,  the  maximum  level  that  could  be  spent  on  testing  undertaken  in  conjuction  with  further 
development,  denoted  At,  was  computed  as  the  difference  between  expected  system  cost  with 
improvement  but  without  testing  and  expected  system  cost  with  both  improvement  and  testing. 
That  is, 

At=E{C**) -E{C**  1 TEST) 

=48.00-47.81 
= $0.19  million. 

Therefore,  if  testing  was  imdertaken  in  conjunction  with  an  improvement  program,  it  was  eco- 
nomical to  spend  an  amount  on  testing  not  to  exceed  $190,000. 


TESTING  WITHOUT  FURTHER  DEVELOPMENT. 

It  was  also  desired  to  evaluate  the  alternative  of  testing  without  further  development.  For 
this  alternative  it  was  assumed  that  two  months  would  be  needed  for  the  test  to  be  completed  and 
the  modification  procedure  applied.  Therefore,  the  system  costs  were  estimated  to  be  distributed 
over  the  3)(-year  period  as  indicated  in  Table  8. 


TABLE  8.  Sy$tem  Co»t$  for  Tetting  Without  Further  Development  ($  MILLION) 


STATE 

(S.) 

INITIAL 

COST 

PERIODIC  REPAIR  COSTS  J 

PRESENT 

VALUE 

(FVi) 

1 

2 

3 

4 

5 

s. 

0 • 

13.  33  t 

2.  67 

8 

8 

8 

34.  63 

s. 

13.  33  t 

4.00 

12 

12 

12 

45.  39 

S, 

0 * 

13.  33  t 

5.  33 

16 

16 

16 

56. 14 

S, 

0 • 

13. 33  t 

6.67 

20 

20 

20 

66.90 

*Co8t  of  testing  not  included.  The  permissible  expenditure  for  this  effort  is  a model  output, 
tlncludes  two-month  repair  cost  with  modification  plus  $10  million  for  cost  of  modification  application. 
{First  period  is  two  months;  second  period  is  four  months;  others  are  12  months. 


The  expected  system  cost  for  this  alternative  for  each  test  outcome  was  given  by  the  equation 

E((7*ir,)=p(s,ir,)pr,-i-p(s,ir,)pv,-i- 

P(S,|r,)PV,-i-P(S4|r,)PF4. 
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The  actual  computations  of  the  posterior  probabilities,  P(iS<|7^),  for  this  alternative  are  not 
shown;  however,  they  were  based  on  the  prior  probabilities  listed  in  Table  1 rather  than  those 
listed  in  Table  2.  Furthermore,  the  discounted  costs,  PF<,  used  were  those  listed  in  Table  8.  Thus, 


as  before. 


EiC*\ Ti)  = (0.7660)  (34.63)  + (0.0851)  (45.39)  + 
(0. 1064)  (56. 14)  + (0.0426)  (66.90) 
=26.53+3.86+5.97+2.85 
=$39.21  million. 


E{C*\ T»)  = (0.0238)  (34.63)  + (0.8333)  (45.39)  + 
(0.1 190)  (56. 14)  + (0.0238)  (66.90) 
=0.83+37.82+6.68+1.  59 
=$46.92  million. 


E(C*\ Tt)  = (0.0054)  (34.63)  + (0.0217)  (45.39)  + 
(0.951 1)  (56.14)  + (0.0217)  (66.90) 
=0.19+0.99+53.39+1.45 
=$56.02  million, 

and 

E{C*\ T,)  = (0.01 18)  (34.63)  + (0.0235)  (45.39)  + 
(0.1176)(56.14)  + (0.8471)(66.90) 
=0.41  + 1.07+6.60+56.67 
=$64.75  million. 


I The  expected  system  cost  associated  with  each  test  outcome  was  then  used  to  compute  the 

I expected  system  cost  given  only  that  testing  was  conducted.  That  is, 

£:(C*  ITEST)  =P  (T.)  £((7*  I r,) +P  (T,)  £((7*  I r,) + 
P{T,)E{C*\T^)+P(T,)E{C*\T,), 

where,  again,  the  marginal  probabilities  associated  with  the  test  outcomes  were  computed  based 
on  the  prior  probabilities  listed  in  Table  1.  Thus, 

^(^♦ITEST)  = (0.1175)  (39.21)  + (0.2100)  (46.92)  + 

(0.4600)  (56.02)  + (0.2125)  (56.96) 
=4.61+9.85+25.77+12.11 
=$52.34  million. 

Finally,  the  maximum  level  that  could  be  invested  in  testing  without  further  development. 
At,  was  computed  as  follows: 

Ar=E(C*)  -E(C7*|TEST) 

=53.29-52.34 
=$0.95  million. 

Therefore,  even  without  further  development,  it  was  economical  to  spend  an  amount  on  testing 
not  to  exceed  $950,000  merely  to  reduce  the  uncertainty  surrounding  the  mean  prior  estimate  of 
failure  rate  reduction. 


i 


1 

i 


I 


J 

I 

1 


i 
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DECISION  TREE 

The  foregoing  procedure  may  also  be  portrayed  in  decision  tree  format.  The  decision  tree, 
complete  with  expected  costs  and  the  probabilities  associated  with  those  costs,  is  presented  in 
Figure  1. 


DECISION 


TEST 

OUTCOME 


- Modify 

a2  - Not  modify 


Further  Dev 
& Testing 

$47.81 


4 .0737 


Further  Dev  w/o  Testing 
$48.00 


No  Dev  or  Testing 
$53.29 

Testing  w/o  Further  Dev 
$52. 34 


DECISION 


‘1  $39.48 


^2 

t 

$56.96 

^1 

$47. 52 

k. 

$56.96 

^1 

$55.42 

k. 

$56.96 

^1 

$59.60 

k. 

$56.96 

^1 

$48.00 

k. 

$56.96 

^1 

$53.29 

k 

$56.96 

STATE 
.8779 
.0975 
.0195 
.0051 
1.0 
.0271 
.9486 
.0217 
. 0026 
1.0 
.0302 
.1197 
. 8381 
.0120 
1.0 
.0341 
. 1696 
.1357 
. 6106 
1.0 
.25 
.50 
.20 
.05 
1.0 


COST 

_$38.04 

.$47.52 

$57.01 

$66.49 

$56.96 

$38.04 

.$47.52 

.$57.01 

.$66.49 

.$56.96 

$38.04 

.$47.52 

.$57.01 

.$66.49 

.$56.96 

.$38.04 

.$47.52 

$57.01 

.$66.49 

$56.96 

.$38.04 

$47.52 

.$57.01 

$66.49 

$56.96 

.$32.7- 

$44.17 

.$55.57 

$66.96 

'56.96 


FIGURE  1.  Bayes’  Decision  Procedure  in  decision  tree  format  (costs  in  $ Millions). 
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I 


TEST 


DECISION 


(S> 


Testing  w/o 
Further  Dev. 


$52.34 


OUTCOME 

DECISION 

STATE 

.7660 

®1  $39.21 

®2  .0851 

®3  .1064 

^1  .1175 

®4  .0426 

®2  $56.96 

®4  1.0 

®1  .0238 

^1  $46.92 

®2  .8333 

®3  .1190 

"^2  .2100 

®2  $56.96 

®4  .0238 

CO 

o 

®1  .0054 

^1  $56.02 

®2  .0217 

®3  .9511 

"^3  .4600 

®4  .0217 

*2  $56.96 

®4  1.0 

®1  .0118 

®2  .0235 

®1  $64.75 

“3  .1176 

"^4  .2125 

®4  .8471 

®2  $56.96 

®4  1.0 

COST 

$34.63 
$45.39 
$56.14 
$66.90 
$56.96 
$34.63 
$45.39 
$56.14 
$66.90 
$56.96 
$34.63 
$45.39 
$56.14 
$66.90 
$56.96 
$34.63 
$45. 39 
$56.14 
$66.90 
$56.96 


FIGURE  . Bayes’  Decision  Procedure  in  decision  tree  format  (costs  in  $ Millions)— Continued. 


SENSITIVITY  ANALYSIS. 


The  Bayes’  Decision  Procedure  used  in  this  decision  situation  was  criticized,  as  is  often  the 
case,  due  to  the  subjective  manner  in  which  the  prior  probabilities  were  derived.  Since  the  subjec- 
tive prior  probabilities  could  not  be  empirically  verified,  the  view  was  expressed  that  the  entire 
procedure  was  based  on  an  intuitive  foundation  and  was,  therefore,  somewhat  suspect  as  a useful 
decision-making  technique.  To  examine  this  criticism,  a sensitivity  analysis  was  conducted  in  which 
the  prior  probabilities  were  varied  and  the  effects  on  key  model  outputs  were  observed.  Since  the 
prior  probabilities  in  this  case  were  discrete,  it  was  necessary  to  assume  various  sets  of  priors  rather 
than  to  merely  change  a continuous  parameter.  Furthermore,  since  a present  value  approach  was 
used  to  determine  the  relevant  costs,  the  sensitivity  of  key  model  outputs  to  the  discount  rate  was 
also  examined.  This  section  discusses  the  structure  of  the  sensitivity  analysis  that  was  conducted 
and  summarizes  the  significant  results. 

To  examine  the  sensitivity  of  the  model  results  to  the  priors,  two  parameters  of  the  priors 
were  used — the  arithmetic  mean  and  the  variance  about  the  mean.  The  prior  mean  was  a measure 
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of  the  tendency  of  the  group  of  estimators  toward  a central  value  and,  in  a sense,  represented  the 
collective  opinion  of  the  group  as  to  the  true  value  of  the  parameter.  Similarly,  the  variance  in- 
dicated the  spread  of  the  priors  about  the  mean  and  was  a measure  of  the  uncertainty  in  the  group 
concemine  the  true  value  of  the  parameter  that  the  group  members  were  trying  to  estimate.  More 
specifically,  the  mean  prior  estimate  could  logically  be  said  to  reflect  the  collective  knowledge  of 
the  members  of  the  group  concerning  the  physical  system  imder  consideration;  the  variance  could 
be  said  to  reflect  the  collective  strength  of  their  convictions  about  the  true  parameter  value. 


TABLE  9.  Data  Seta  for  Senaitwity  Analyaia 


STATE 

SET 

SET 

SET 

SET 

SET 

m 

#1 

#2 

#3 

#4 

#5 

CONSTANT  VARIANCE  (260)  (%) » 

Si 

0.55 

s. 

0.  35 

s. 

0.20 

0.  05 

s. 

0. 15 

0. 

0.  05 

0. 05 

MEAN 

24 

31 

39 

44 

48 

(%) 

CONSTANT  MEAN  (39)  (%) 

Si 

0. 15 

0.  20 

0. 25 

0.  25 

0.  30 

s. 

0.  70 

0.50 

0.  55 

St 

0. 10 

0. 20 

0.  10 

0. 15 

St 

0.  05 

0. 05 

0.  10 

0. 10 

VAR 

179 

219 

260 

299 

339 

(%)* 

i 


'i 

1 

.1 

j 


In  choosing  the  sets  of  prior  probabilities  to  examine,  we  selected  five  sets  with  a constant  ] 

mean  and  five  sets  with  constant  variance.  The  sets  of  prior  probabilities  used  are  set  forth  in  | 

Table  9.  The  two  groups  of  prior  probability  sets  had  one  set,  the  base  estimate,  in  common.  Thus, 
hy  solving  the  model  for  each  of  the  prior  probability  groups,  it  was  possible  to  separately  examine 

the  sensitivities  of  model  outputs  to  the  prior  mean,  holding  variance  constant,  and  to  prior  variance  , 

holding  the  mean  value  constant. 

The  first  outputs  examined  for  sensitivity  were  the  principal  model  outputs — permissible  i 

expenditure  level  on  modification  development,  Ao,  and  permissible  expenditure  level  on  testing  j 

undertaken  in  conjunction  with  modification  development,  Ar.  These  two  outputs  were  examined  | 

for  sensitivity  to  both  prior  mean  and  prior  variance  and  separately  to  the  discount  rate.  In  addi-  | 
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tion,  ihe  expected  system  costs  for  the  five  altemstives  considered  were  examined  for  sensitivity 
to  the  discount  rate. 

The  results  relating  Ad  to  prior  mean  and  variance  are  illustrated  in  Figure  2.  Two  observa- 
tions based  on  these  results  are  particularly  noteworthy. 

First,  Au  was  found  to  be  an  increasing  linear  function  of  the  prior  mean.  In  fact,  since  the 
prior  mean  could  be  viewed  as  representing  the  reliability  of  the  modification  procedure,  the  implica- 
tion was  that  an  increasing  linear  relationship  existed  between  reliability  and  the  level  of  expendi- 
ture permissible  for  modification  improvement.  Moreover,  an  examination  of  the  slope  of  the 
linear  relationship  revealed  that  a change  in  reliability  of  one  percent  would  cause  an  average 
change  of  $474,200  in  Ad.  Thus,  the  relationship  was  not  only  linear  but  quite  sensitive,  indicating 
that  further  efforts  to  improve  the  reliability  of  the  modification  procedure  might  well  pay  signifi- 
cant dividends.  Theae  results  are  analogous  to  those  reported  by  Brown  and  Perlman  [1]  in  their 
study  describing  a repair  parts  inventory  model  for  the  F-14  aircraft. 

Second,  Ad  was  found  to  be  totally  unrelated  to  the  prior  variance.  That  is,  the  level  of  uncer- 
tainty exhibited  in  the  prior  probabilities,  by  way  of  variance  about  the  mean,  had  nothing  at  all 
to  do  with  determining  the  level  of  permissible  expenditures  on  modification  development — ^which 


Permissible  Development  Expenditure  (A^) ($Mil) 


FIGURE  3.  Influence  of  prior  mean  and  variance  on  penniMible  expenditure  level  for  modification  development. 
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for  a prior  mean  of  39  percent,  was  a constant  $5.29  million.  That  is  to  say,  with  respect  to  Aa, 
it  was  the  collective  knowledge  of  the  group  of  engineers  estimating  the  priors  (exhibited  in  their 
tendency  toward  a central  value)  that  was  important,  rather  than  the  strength  of  the  group’s 
conviction  about  the  mean  value.  Althou^  it  is  difficult  to  completely  divorce  a discussion  of  means 
from  a discussion  of  variances,  the  relationships  exhibited  by  Fig.  2 would  seem  to  support  a general 
hypothesis  that  the  value  of  priors  in  decision-making  is  positively  correlated  to  the  professional 
competence  of  the  group  estimating  the  priors,  and  that  the  uncertainty  in  the  group  is  not  as 
important  as  bias  or  the  lack  of  objectivity  on  the  part  of  the  group  members. 

The  results  relating  Ar  to  prior  mean  and  variance  are  illustrated  in  Figure  3.  The  relation- 
ships exhibited  by  Fig.  3 were  perhaps  the  most  germane  of  the  sensitivity  analysis  since  they 
dealt  more  directly  with  the  principal  issue  of  the  analysis — that  of  acquiring  additional  informa- 
tion through  testing.  Two  major  observations  were  important  in  this  respect. 

First,  unlike  permissible  development  expenditures,  permissible  testing  expenditures  varied 
with  both  prior  mean  and  prior  variance.  As  prior  variance  increased,  holding  the  mean  constant, 
the  permissible  expenditures  on  testing  increased  at  an  increasing  rate.  This  implied  that  the 
higher  the  level  of  uncertainty  in  the  prior  probabilities,  the  greater  the  payoff  from  testing  was 
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Permissible  Testing  Expenditure  (A^) ($M11) 

FIGURE  3.  Influence  of  prior  mean  and  variance  on  permMble  expenditure  level  for  modification  teatinf . 
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likely  to  be.  On  the  other  hand,  as  the  prior  mean  increased,  holding  the  variance  constant,  the 
permissible  expenditures  on  testing  decreased  at  a decreasing  rate.  This  implied  that  the  more 
optimistic  the  prior  estimate  was  concerning  the  level  of  reduction  in  failure  rate  expected  to  be 
brought  about  by  modification  improvement,  the  smaller  the  expected  payoff  from  testing  was 
likely  to  be. 

Second,  a major  implication  of  Fig.  3 was  that  prior  estimates  should  conform  as  closely  as 
possible  to  the  true  results  being  estimated.  Stated  in  another  way,  it  was  important  that  the  prior 
estimates  represent  the  most  informed  judgment  available.  Despite  this  obvious  finding,  however, 
it  was  instructive  to  observe  that  regardless  of  the  accuracy  and  uncertainty  characterizing  the 
prior  estimates,  the  Bayes’  Decision  Procedure  tended  to  produce  a higher  quality  decision.  This 
latter  observation  was  deduced  from  the  result  that,  for  all  values  of  the  prior  mean  and  variance 
examined,  it  was  advisable  to  invest  in  testing  to  secure  additional  information  before  making 
a modification  decision. 

The  model  outputs,  Ao  and  Ar,  were  also  examined  for  sensitivity  to  the  disroimt  rate.  How- 
ever, the  only  significant  finding  in  this  respect  was  that  as  the  discount  rate  increased,  there  was 
a tendency  to  spend  less  on  product  improvement  and  more  on  product  testing.  With  respect 
to  modification  development,  this  result  merely  illustrated  the  well-known  fact  that  as  the  oppor- 
tunity cost  of  money  increases,  the  less  economical  it  becomes  to  invest  current  dollars  in  anticipa- 
tion of  future  savings.  With  respect  to  modification  testing,  the  result  emphasized  the  importance 
of  making  a higher  quality  decision  to  offset  the  lower  reliability. 

The  final  model  outputs  examined  for  sensitivity  were  the  expected  system  costs  for  each  of 
the  five  alternatives  considered.  These  costs  were  examined  for  sensitivity  to  the  discoimt  rate, 
and  the  results  are  illustrated  in  Figure  4.  Several  observations  from  this  part  of  the  analysis  were 
particularly  interesting. 

First,  the  alternative  of  both  improving  and  testing  the  modification  procedure  before  ap- 
plication was  found  to  be  dominant  for  all  discount  rates  examined.  That  is  to  say,  the  preference 
for  this  alternative  over  the  other  alternatives  considered  was  not  sensitive  to  the  discount  rate. 

Second,  with  respect  to  testing,  it  was  preferable  at  all  discount  rates,  to  undertake  product 
improvement  in  conjunction  with  testing,  but  testing  without  improvement  was  always  preferable 
to  applying  the  modification  with  neither.  In  other  words,  even  if  no  further  improvement  in  the 
modification  procedure  was  made,  it  was  still  economically  advisable  to  undertake  a testing  pro- 
gram. This  latter  result  is  consistent  with  a similar  finding  by  Brown  and  Rogers  [2]  in  their  ap- 
plication of  Bayes’  Theorem  to  the  problem  of  estimating  demand  for  the  F-14  aircraft. 

Finally,  for  all  discount  rates  below  about  28  percent,  it  was  found  advisable  to  apply  the 
modification  in  some  form.  However,  for  discount  rates  above  about  28  percent,  it  was  foimd  that 
if  modification  was  to  result  in  lower  expected  system  costs,  the  modification  procedure  had  to  be 
Mtber  improved  and/or  tested  before  application. 

I (»eM  Li  mopf 

. tlw  M»aiv«>*  t wa*  ' xwarv  to  'tddniiw  one  final  cnticism  of  the  Bayes’  Pro- 

• • '*•  ■ -1  ■ ••rtM'ilar  ■ and  one  whirh  fretpiently  surfaces  when  decision 

' — «-aie*e«  4«< » ervwrw  of  Ui*  procedure  suntMited,  as  they 
....  . iti*  •wstli  weld  be  higher — not  lower — 
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FIGURE  4.  Influence  of  annuel  discount  rate  on  discounted  expected  system  cost. 

net  system  cost.  The  rationale  for  this  contention  may  be  seen  in  the  following  discussion  related 
to  the  problem  addressed  in  this  paper: 

Recall  that  an  expected  failure  rate  reduction  level  of  24  percent  could  be  achieved  by  applying  the  modi- 
fication without  further  development,  and  that  further  development  was  expected  to  improve  this  reduction 
level  to  39  percent.  It  follows,  then,  that  the  permissible  investment  level  in  further  development  (Ad) 
of  $6.29  million  was  based  on  achieving  an  increase  in  the  mean  failure  rate  reduction  of  not  less  than  15 
percent.  If,  after  spending  the  $6.29  million,  it  had  been  found  that  a failure  rate  reduction  of  less  than  16 
percent  had  been  achieved,  the  conclusion  could  have  been  justifiably  drawn  that  it  would  have  been 
cheaper  to  have  applied  the  modification  without  further  development.  This  was  the  danger,  the  critics  j 

claimed,  in  basing  a technique  on  subjectively  determined  prior  probabilities  that  reflected  the  human 
characteristic  of  being  overly  optimistic. 

How  does  one  handle  this  apparent  anomaly?  In  a general  sense,  the  answer  to  this  question  con- 
stitutes one  of  the  key  requirements  in  making  the  Bayes’  Procedure  work.  In  this  particular 
case,  the  answer  was  that  the  amount  spent  on  development  could  not  be  permitted  to  exceed  the 
programmed  amount.  If  it  had,  the  critics  would  have  been  correct — the  procedme  would  have 
been  more  expensive.  Therefore,  it  was  necessary  that  the  funds  allocated  to  product  improvement 
be  spent  in  a manner  that  achieved  the  programmed  level  of  improvement  upon  which  the  fund 
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allocation  was  based.  Stated  more  blunthly,  it  would  have  been  necessary  to  terminate,  or  at  least 
revise  the  program,  as  soon  as  it  had  become  clear  that  the  programmed  level  of  improvement  would 
not  be  achieved.  Failure  to  do  this  would  not  have  implied  a deficiency  with  the  Bayes’  Procedure; 
it  would  have  implied  a deficiency  in  managing  its  implementation. 

SUMMARY. 

The  analysis  described  in  this  paper  has  illustrated  the  application  of  a very  useful  procedure 
for  addressing  a real-world  sequential  decision-making  opportunity  in  the  face  of  uncertainty. 
Although  the  problem  described  is  military  in  character,  the  procedure  is  equally  applicable  in 
other  decision  environment.<>.  Typically,  the  developer  of  a product  must  answer  two  questions: 
How  much  should  be  spent  to  develop  (or  improve)  the  product  further,  and  how  much  should 
be  spent  on  testing  (or  other  activity)  to  acquire  additional  information  about  the  product  before 
making  the  marketing  (or  distribution)  decision?  Treatment  of  these  core  questions  has  constituted 
the  essence  of  the  paper.  It  is  important  to  recognize,  however,  that  the  treatment  has  been  only 
partial,  since  possible  economies  in  the  specific  development  and  testing  programs  were  not  ad- 
dressed. That  is,  only  maximum  aUowahU  program  expenditures  were  derived — the  upper  bounds 
on  development  and  testing  costs. 

The  procedure  described  made  use  of  Bayes’  Theorem,  Bayes’  Decision  Procedure,  and  a 
present-value  analysis  in  the  estimation  of  future  cost  savings  from  current  alternative  decision 
opportunities.  Central  to  the  procedure  was  the  method  used  to  update  subjective  prior  probabilities 
based  on  later  information  received  from  testing  as  a means  of  facilitating  a posterior  expected 
value  analysis  of  system  costs.  The  procedure  began  with  a prior  probability  distribution  which 
captured,  insofar  as  possible,  all  of  the  knowledge  available  at  the  time  concerning  the  reliability 
of  the  proposed  modification  procedure,  and  then  depicted  the  decisionmaker  as  having  five  distinct 
alternatives. 

First,  he  could  choose  not  to  apply  the  modification  and  to  continue  to  sustain  the  present 
high  system  costs.  It  was  shown  that  this  alternative  was  not  economically  advisable,  since 
application  of  the  modification  even  without  further  improvement  would  result  in  a lower  expected 
system  cost. 

Second,  he  could  choose  to  apply  the  modification  without  further  improvement  or  testing. 
However,  based  on  the  prior  probabilities  given,  it  was  shown  that  this  alternative  could  also  be 
improved  on  by  developing  the  modification  procedure  further  or  by  testing  it  without  further 
development  before  application. 

Third,  he  could  choose  to  improve  the  modification  and  then  apply  it  without  first  subjecting 
it  to  a testing  procedure.  However,  it  was  shown  that  testing  would  give  information  to  the  decision- 
maker that  he  could  use  to  improve  the  quality  of  his  decision  and  further  reduce  the  expected 
system  cost. 

Fourth,  he  could  choose  to  test  the  modification  procedure  without  improvement  before 
application.  Although  this  alternative  was  preferable  to  the  alternatives  of  either  not  applying  the 
modification  or  applying  it  without  testing,  it  was  shown  that  this  alternative  could  also  be  improved 
on  by  developing  the  modification  procedure  further  before  application,  and  that  this  relationship 
held  for  all  annual  discount  rates  up  to  and  including  35  percent. 
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Finally,  the  fifth  alternative — that  of  improvement  and  testing  before  application — was  shown 
r to  be  the  most  economical  alternative  based  on  expected  system  cost  over  a Sl^year  period. 

'f  A comprehensive  sensitivity  analysis  was  then  conducted  to  examine  the  sensitivity  of  the 

I key  model  outputs  to  both  the  mean  and  variance  of  the  prior  probabilities  and  separately  to  the 

discount  rate.  Several  of  the  results  of  the  sensitivity  analysis  were  significant.  They  showed,  for 
' example,  that 

1.  The  permissible  amount  to  spend  on  further  development  of  the  modification  procedure 
was  a sensitive  increasing  linear  function  of  the  prior  mean  but  was  totally  unrelated  to  the  prior 
variance.  That  is  to  say,  with  respect  to  this  particular  output,  it  was  the  collective  knowledge  of 
the  group  of  engineers  estimating  the  priors  (exhibited  in  their  tendency  toward  a central  value) 
that  was  important,  rather  than  the  strength  of  the  group’s  conviction  (prior  variance)  about  the 
mean  value.  < 

I 2.  The  permissible  amount  to  spend  on  testing  was  sensitive  to  both  the  prior  mean  and  prior  | 

variance.  The  implication  of  this  was  that  the  prior  estimates  should  represent  the  most  informed  i 

judgment  available.  However,  since  testing  expenditures  were  found  to  be  permissible  over  a wide  i 

range  of  prior  probabilities,  it  was  also  clear  that  the  Bayes’  Decision  Procedure  could  result  in  | 

a higher  quality  decision  even  when  the  prior  estimates  represented  a low  order  of  prior  knowledge.  i 

[ 3.  The  permissible  amoimt  to  spend  on  further  development  varied  inversely  with  the  discount 

I rate,  and  the  permissible  amount  to  spend  on  testing  varied  directly  with  the  discoimt  rate.  That 

I is  to  say,  as  the  discount  rate  increased,  the  tendency  was  to  spend  less  on  product  improvement 

L and  more  on  product  testing. 

4.  The  alternative  of  both  improving  and  testing  the  modification  procedure  before  application 
dominated  the  other  four  alternatives  for  all  discount  rates  examined  (0%  to  35%).  In  addition, 
the  alternative  of  testing  the  modification  procedure  without  further  improvement  before  applica- 
tion dominated  the  alternative  of  neither  improving  nor  testing  the  procedure.  This  later  finding 
I was  particularly  interesting,  since  it  indicated  that  expected  system  cost  could  be  lowered  by 

^ investing  only  in  efforts  to  reduce  imcertainty.  That  is,  even  if  no  further  improvement  in  the 

I modification  procedure  was  made,  it  was  still  economically  advisable  to  undertake  a testing  program. 

The  results  of  the  analysis  indicated  that  both  improvement  and  testing  should  be  undertaken 
and  that  the  maximum  amounts  that  should  be  spent  on  the  programs  were  $5.29  million  and 
$0.19  million,  respectively.  As  indicated  previously,  the  actual  amount  to  spend  on  each  program 
was  not  addressed  since  that  would  have  involved  an  examination  of  the  economies  exhibited 

by  the  specific  development  and  testing  programs  used,  which  was  not  done  until  later.  However,  ] 

the  analysis  showed  that  to  the  extent  it  was  possible  to  conduct  the  development  and  testing  ! 

programs  for  amounts  less  than  the  maximums  permissible,  substantial  savings  could  be  realized 
by  postponing  application  of  the  procedure  to  permit  it  to  be  improved  and  tested. 

The  results  of  the  analysis  stimulated  management  in  at  least  two  ways:  first,  to  secure  the 
most  reputable  engineers  available  to  estimate  the  priors,  and  then  to  undertake  both  development 
and  testing  as  described  in  alternative  five.  The  development  and  testing  programs  were  actually 

conducted  for  amounts  less  than  the  maximums  permissible  and,  as  a result,  sizeable  savings  were  j 

I realized.  ] 
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ABSTRACT 


This  paper  deals  with  the  problem  of  scheduling  items  (tasks,  employees, 
equipment,  etc.)  over  a finite  time  horizon  so  as  to  minimize  total  cost  expenditures 
while  maintaining  a predefined  separation  between  certain  items.  The  problem  is 
cyclic,  because  the  same  schedule  will  be  repeated  over  several  consecutive  time 
periods  of  equal  length.  Thus,  requirements  are  present  to  maintain  the  separation 
of  items  not  only  within  the  individual  time  periods  considered,  but  also  between 
items  in  adjoining  periods.  A special  purpose  branch-and-bound  algorithm  is  devel- 
oped to  solve  this  scheduling  problem  by  taking  advantage  of  its  cyclic  nature. 
Computational  results  are  given. 


1.  INTBODUCTION 

Through  the  years,  the  classical  assignment  problem  has  had  numerous  offspring.  These  include 
the  multidimensional  assig^nment  problem  [10],  the  quadratic  assignment  problem  [6, 7, 9],  the  travel- 
ing salesman  problem  [8],  and  the  generalized  assignment  problem  [11].  This  paper  adds  the  cyclic 
separation  scheduling  problem  to  the  list  of  progeny. 

The  Separation  Scheduling  Problem  (SSP)  consists  of  assigning  items  (tasks,  employees, 
equipment,  etc.)  in  such  a way  that  a predefined  distance  is  maintained  between  certain  items. 
We  define  to  be  the  separation  matrix,  where  an  element  «>,==««  specifies  that  item  j must  be 
separated  from  item  q by  positions.  For  the  purpose  of  establishing  terminology  and  to  lead 
to  the  application  where  our  work  is  currently  being  directed,  we  assume  that  n items  are  avaUable 
for  wheduling  on  m da>s.  One  and  only  one  item  may  be  assigned  on  any  day,  and  the  number 
of  times  an  item  may  be  scheduled  during  the  m days  is  limited.  When  the  objective  is  to  minimize 
the  total  cost  of  assigning  the  items,  the  separation  scheduling  problem  can  be  formulated  as  the 
following  integer  programming  problem: 


•Thl«  study  was  supported,  in  part,  by  the  Office  of  Naval  Research. 
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m « 

(1) 

subject  to 

Minimize  S S 

i-i  y-i 

(2) 

n 

m • 

(3) 

(4) 

Xii+Xrt<l  if  Sj^ydir, 

r=l,  2,  . . m-1; 

t=r+l,  r+2 m; 

i=l,  2 n\ 

g=l,  2,  . . .,n; 

(5)  a:,y=Oorl  i=l,  2 m; 

2,  . . .,n, 

where 

(1,  if  item  j is  scheduled  on  day  i 
0,  otherwise, 

dt,=i—r,  Cif  is  the  cost  of  assigning  item  j on  day  i,  and  u,  is  the  maximum  number  of  times 
item  j may  appear  during  m days. 

This  problem  was  formulated  and  solved  by  Balintfy  [2]  by  using  an  algorithm  similar  to  that 
of  Little,  et  al.  [8]  to  solve  the  traveling  salesman  problem.  The  problem  occurs  in  the  context  of 
scheduling  menu  items  over  the  days  of  the  planning  horizon,  where  the  purpose  of  separation 
constraints  is  to  ensure  variety  in  the  schedule.  It  is  conceivable,  however,  that  separation  scheduling 
has  the  opposite  goal — to  ensure  proximity  of  certain  items  within  specified  levels.  The  modifica- 
tions to  our  algorithm  to  account  for  the  latter  aspect  will  be  discussed  in  the  conclusions. 

In  menu  plans  [1],  if  a schedule  covering  tn  days  is  thought  of  as  being  repeated  every  m days, 
the  item  allocated  on  day  1 will  be  the  same  as  the  item  on  day  the  item  on  day  2 will  be  the 
same  as  that  on  day  m-|-2,  and  so  on.  This  cyclic  property  implies  that,  if  item  A is  constrained  to 
be  separated  from  item  B by  at  least  two  days,  and  item  A appears  on  day  1 of  the  schedule,  then 
item  B should  not  appear,  not  only  on  days  2 and  3,  but  also  on  days  m and  m—  1.  Such  a schedule 
can  be  conceptualized  as  the  allocation  of  items  to  equispaced  points  on  the  circumference  of  a circle, 
and  the  separation  between  two  points  is  then  the  minimum  of  the  distances  between  them  in  the 
clockwise  and  counterclockwise  directions.  More  formally,  the  distance  between  discrete  points 
i and  r,  is  defined  by 

(6)  d„  = min  {(i  — r),  m— (i— r)}. 

An  additional  feature  of  the  problem  is  that  the  cost  of  an  item  depends  only  on  the  item,  and 
not  on  where  it  is  scheduled.  Thus,  if  a cost  matrix  (7  is  defined  such  that  c</  is  the  cost  of  assigning 
item  j to  day  i,  the  rows  of  the  cost  matrix  are  identical;  i.e.,  Ct,=c,i  for  all  i,  r,j.  It  is  this  property. 


CYCLIC  SEPARATION  SCHEDULINQ  fill 

along  with  the  cyclic  nature  of  the  separation  constraints,  which  is  used  to  defined  the  Cyclic 
Separation  Scheduling  problem  (CSSP): 


(7) 


Minimize  ^ ^ subject  to  (2) — (5), 


where  the  distance  function  is  now  defined  by  (6),  and  the  separation  matrix  S is  defined  as  before. 

The  algorithm  of  Balintfy  used  to  solve  the  SSP  applies  directly  to  the  CSSP,  as  the  CSSP  is 
only  a special  case  of  the  SSP.  However,  no  advantage  is  taken  of  the  cj^clic  property  of  the  separa- 
tion constraints  and  the  special  structure  of  the  cost  matrix.  The  objective  addressed  here  is  to 
exploit  the  aforementioned  properties  in  developing  a more  efiScient  solution  technique. 


2.  SEPARATION  SCHEDULING 

The  fundamental  features  of  the  parent  branch-and-bound  algorithm  developed  and  imple- 
mented by  Balintfy  are  recapitulated  in  this  section. 

A branch-and-boimd  algorithm  involves  breaking  up  the  set  of  all  solutions  into  smaller  and 
smaller  subsets  and  finding  lower  bounds  on  the  costs  of  the  subsets  until  a single  solution  is  found 
with  a cost  less  than  or  equal  to  the  lower  bounds  on  the  costs  of  all  other  subsets  of  solutions. 
Consequently,  in  any  branch-and-bound  algorithm,  a set  of  rules  is  required  for  (a)  branching, 
i.e.,  partitioning  solution  sets,  (b)  determining  lower  bounds  on  sets  of  solutions,  (c)  selecting  the 
set  of  solutions  to  partition  next,  and  (d)  recognizing  infeasible,  suboptimal,  or  optimal  solutions. 

In  the  case  of  the  separation  scheduling  problem,  a feasible  solution  is  represented  by  a set 
of  ordered  pairs:  (ij.Jt),  ■ • .,  {i»,  j,,)}.  The  inclusion  of  pair  (i,j)  in  this  set  means  that  item 

j is  scheduled  to  appear  in  time  period  i.  A set  of  solutions  will  be  represented  by  H^(S),  where  S' 
is  a set  of  ordered  pairs  characterizing  the  solution  set,  and  consists  of  (a)  a subset  of  ordered  pairs 
S representing  allocations,  and  (b)  a subset  of  ordered  pairs  S representing  prohibitions.  If  (i,  j)  is 
in  S,  this  means  that  every  solution  in  W(S)  must  have  item  j scheduled  on  day  i,  and  if  (t,  j)  is  in 
5,  item  J must  be  barred  from  day  i in  every  solution  in  1^(5).  The  bars  are  placed  under  or  over  the 
ordered  pair  to  indicate  which  part  of  S the  element  belongs,  to.  When  no  bar  appears,  the  ordered 
pairs  referred  to  may  be  either  from  S'  or 

In  the  separation  scheduling  algorithm,  to  partition  solution  sets,  an  ordered  pair  (i,  j)  is 
selected  as  the  partitioning  element  such  that  for  the  solution  set  W(S)  being  partitioned,  (i,  j) 
is  not  in  S.  Then  W(S,)  and  W(Sa)  constitute  the  partition  of  W(S),  where  Si=Sf/(t7j)  and  Sj= 
j)-  Thus,  W(Si)  represents  the  subset  of  solutions  from  W(S)  which  have  item  j barred  from 
being  on  the  ith  day,  and  W(Si)  represents  the  subset  of  solutions  of  W(S)  which  have  itemj 
scheduled  on  the  ith  day.  This  method  of  partitioning  is  similar  to  the  algorithm  of  Little,  et  al.  to 
solve  the  traveling-salesman  problem. 

The  process  of  branching  is  conveniently  represented  by  a tree,  with  the  nodes  representing 
sets  of  solutions.  A branch  connects  node  N and  Nt  of  the  tree  when  the  solution  set  W(iS0  repre- 
sented by  N contains  the  solution  set  W(iSi)  represented  by  Ni.  L{N)  will  then  denote  a lower 
bound  on  the  value  of  the  objective  function  for  all  the  solutions  represented  by  node  N.  No  further 
partitioning  is  required  at  node  iV  when  L{N)  is  greater  than  the  objective  function  value  associated 
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with  any  known  feasible  schedule.  In  such  a case,  the  node  is  said  to  be  fathomed.  Fathoming  also 
takes  place  when  either  a feasible  schedule  is  achieved  or  it  is  ascertained  that  no  solution 

can  be  found  in  W(S). 

The  features  of  the  SSP  which  make  it  a restricted  assignment  problem  are  the  separation 
constraints.  These  constraints  are  utilized  in  the  algorithm  tc  nifcintAin  ^ separation  between 
certain  items.  No  allocations  are  permitted  which  would  violate  the  constraints  of  the  problem. 

A statement  of  the  algorithm  that  contains  a detailed  description  of  the  rules  used  in  our 
implementation  is  given  in  section  4. 


S.  CYCLIC  SEPARATION  SCHEDUUNG  PROBLEM 


In  branch-and-bound  algorithms,  the  terminal  nodes  of  the  solution  tree  represent  the  set  of 
aU  solutions.  In  the  CSSP,  very  early  in  the  branching  process  it  is  possible  to  eliminate  whole 
subsets  of  solutions  from  consideration.  With  the  cyclic  property  of  the  problem  in  mind,  the 
following  definitions  are  given. 

DEFINITION:  A schedule  is  a trantlation  of  a schedule  X*  if  t=l,  2,  . . ., 

2,  . . .,  n,  for  some  constant  integer  k,  0<k<m,  where 


(8) 


i®k= 


f»+*. 

i+i— m. 


t+i<m 


DEFINITION : A schedule  Jf'  is  a mirror  image  of  a schedule  X*  about  position  k if 
i=0,  1,  2,  . . .,  m— l;j=l,2,  . . .,n;  where  i 0 it  is  defined  by  (8),  and 


k9i= 


fi-t. 


i-i>0 


(m+(jfc— i),  t— i<0. 


Consider,  for  example,  the  following  three  solutions  to  a seven-day  CSSP: 

Day  Solution  1 Solution  2 Solution  3 


1 

2 

3 

4 
6 
6 
7 


A 

B 

A 

C 

D 

A 

B 


D 

A 

B 

A 

B 

A 

C 


B 

A 

D 

C 

A 

B 

A 


Notice  that  Solution  2 can  be  obtained  from  Solution  1 by  shifting  all  the  allocations  three  days 
ahead.  Thus,  Solution  2 is  a translation  of  Solution  1.  Also,  Solution  3 can  bo  obtained  from  Solution 
1 by  taking  a mirror  image  of  the  allocations  about  day  4. 

DEFINITION : Two  schedules  X'  and  X*  are  eguimUerU  if  either  X*  is  a translation  of  X', 
or  a third  schedule  exists  that  is  a translation  of  X'  and  a mirror  image  of  X*. 

Thus,  Solutions  1,  2,  and  3 listed  above  are  all  equivalent.  When  showing  the  equivalence 
between  Solution  1 and  Solution  2,  the  third  solution  mentioned  in  the  definition  is  given  by  k=0. 
In  other  words,  the  trivial  translation  gives  back  Solution  1. 
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This  definition  of  equivalence  among  solution  sets  preserves  (a)  the  frequency  of  each  item 
on  the  schedule,  and  (b)  the  separation  in  days  between  item  pairs.  Therefore,  infeasibiliy,  sub- 
optimality, or  optimality  of  one  implies  infeasibility,  suboptimality,  or  optimality  of  the  other, 
respectively. 

DEFINITION:  A dormant  node  is  a node  corresponding  to  a solution  set  characterized  by 
prohibitions  only. 

In  terms  of  the  solution  tree,  a dormant  node  is  one  which  is  connected  to  the  start  of  the 
tree  through  branches  corresponding  to  prohibitions  only.  Thus,  if  a node  N corresponding  to  a 
solution  set  W^(S)  is  characterized  by  S=  |S,  S},  then  S=  $,  i.e.,  5 is  empty.  It  will  be  shown  that 
recogmtion  of  a dormant  node  is  instrumental  in  recognizing  translations  of  solution  sets,  and 
that  by  utihzing  this  recogmtion  we  can  obtain  higher  bounds  for  the  node,  as  well  as  fewer  branches 
for  the  solution  tree. 

DEFINITION : A bare  node  is  a node  representing  a solution  set  such  that,  in  tracing  back 
to  the  start,  the  first  branch  encountered  corresponds  to  a prohibition,  and  there  is  only  one  branch 
corresponding  to  an  allocation. 

In  this  case,  the  solution-characterizing  set  Ss  |S,  S)  is  such  that  the  cardinality  of  S'  is  one. 
It  will  be  shown  that  recognition  of  a bare  node  is  instrumental  in  recognizing  mirror  images  of 
solution  sets. 

First,  consider  the  case  when  the  first  partitioning  of  the  solution  set  is  being  performed.  The 
solution  tree  at  this  stage  will  look  as  follows: 


If  the  cycle  time  of  the  schedule  is  m,  there  exist  m—\  sets  of  solutions  equivalent  to  the  one 
represented  by  node  3 of  the  above  figure.  The  set  of  equivalent  solution  sets  are  characterized 
by  Since  all  these  solution  sets  are  equivalent,  any  one  of  them 

can  represent  all  of  them.  So  we  must  prohibit  all  except  one,  e.g.  { (i,  j) } . Further,  it  may  be  noticed 
that  the  set  of  solutions  represented  by  node  2,  i.e.,  characterized  by  {(1,^)},  is  the  set  of  solutions 
that  includes  all  the  solutions  from  the  sets  equivalent  to  the  set  characterized  by  { (i,  j) } , except 
for  those  which  must  have  element  {i,j)  in  them.  But  this  particular  subset  of  the  equivalent 
solution  sets  is  already  included  in  the  solution  set  represented  by  node  3.  Hence,  a way  to  eliminate 
further  consideration  of  the  sets  of  solutions  equivalent  to  those  represented  by  node  3 is  to 
eliminate  them  from  node  2.  This  can  be  done  easily,  not  only  by  letting  node  2 represent  the  set 
of  solutions  in  which  item  j is  prohibited  from  being  scheduled  on  day  i,  but  also  by  adding  the 
additional  constraint  on  node  2 that  item  j must  not  be  scheduled  on  any  day.  In  this  way,  con- 
sideration of  all  subsets  of  solutions  equivalent  to  those  represented  by  node  3 will  be  eliminated. 
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Without  consideration  of  translations  of  solution  sets,  partitioning  was  done  in  the  following 
manner: 

Either  item  j is  scheduled  on  day  i, 

Or  item  j is  barred  from  being  scheduled  on  day  i. 

Consideration  of  translations  of  solution  sets  results  in  a dichotomy  of  the  following  kind : 

Either  item  7 is  scheduled  at  least  once,  e.g.,  on  day  i, 

Or  item  j is  completely  eliminated  from  the  schedule. 

Elimination  of  an  item  j from  the  schedule  will  be  represented  by  (7).  So  far,  we  have  been  talking 
only  of  the  first  dichotomy  performed  in  the  branching  process.  This  concept  is  easily  extended  to 
subsequent  partitions,  provided  certain  conditions  are  met.  Suppose  some  node  corresponds  to 
prohibitions  only,  i.e.,  it  is  a dormant  node,  and  that  each  prohibition  corresponds  to  the  elimina- 
tion of  the  item  from  the  problem,  in  line  with  the  partitioning  scheme  mentioned  above.  Thus, 
the  node  corresponds  to  solution  sets  in  which  some  items  have  been  taken  out  of  consideration. 
This  is  clearly  a subproblem  of  the  original  problem,  with  some  items  eliminated,  and  hence,  one 
can  proceed  as  if  solving  just  a smaller  separation  scheduling  problem. 


Now  consider  the  simplest  case  of  a bare  node. 

In  the  above  sketch,  node  4 is  a bare  node  because  by  tracing  back  to  node  1 from  node  4,  the 

first  branch  encountered  has  a prohibition,  i.e.,  (]E,  r),  and  the  only  other  branch  has  an  allocation,  : 

i.e.,  (i,  jf).  If  node  5 represents  the  solution  set  W(S),  by  the  previous  discission,  an  inversion  of  i 

W(S),  e.g.,  W(iSi),  can  be  constructed  by  taking  the  mirror  images  of  all  allocations  and  prohibi-  i 

tions  about  day  i.  But  W(iSi)  is  included  in  the  solutions  represented  by  node  4.  Hence,  as  in  the  j 

case  of  dormant  nodes,  a way  of  prohibiting  consideration  of  W(iS'i)  is  to  eliminate  it  from  node  4.  ^ 

This  is  easily  achieved  by  taking  the  mirror  image  of  (k,  r)  about  day  i,  and  if  this  is  (g,  r),  by 
letting  node  4 have  the  extra  prohibition  (g,r). 

A special  case  arises  when  k=g.  Equivalence  cannot  be  exploited  here,  as  the  solution  set  < 

is  its  own  mirror  image.  ] 
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4.  IMPLEMENTATION  AND  COMPUTATIONAL  EXPERIENCE 

The  algorithms  outlined  here  to  solve  the  SSP  and  CSSP  have  been  coded  in  FORTRAN  for 
the  CDC6000  computer.  A discusrion  of  how  the  fundamental  aspects  of  the  algorithms  were 
implemented  follows. 

Coutraetioii  of  the  Sointion  Tree 

A direct  descent  [12]  approach  was  employed  to  execute  the  partitioning  process.  Two  varia- 
bles and  one  array  were  required  to  completely  define  the  solution  tree  and  the  porition  on  the 
tree  at  any  stage.  One  variable  identified  the  current  day  and  the  other  variable  identified  the 
level  in  the  tree  of  the  node  currently  under  consideration.  The  array  contained  the  indices  of  those 
items  which  received  either  an  allocation  or  a prohibition  on  the  direct  path  from  the  node  cur- 
rently under  consideration  to  the  initial  node.  A negative  index  indicated  a prohibition  and  a 
positive  index  indicated  an  allocation.  Prohibitions  or  allocations  were  always  made  on  the  days 
in  a sequential  manner  beginning  at  day  1,  aUocations  being  assigned  to  the  least  cost  item  free 
on  that  day.  The  code  continued  to  make  allocations  until  fathoming  took  place.  After  fathoming, 
the  code  backtracked  up  the  tree  until  a node  with  an  allocation  was  encountered.  At  that  point, 
the  item  allocated  at  that  day  was  prohibited,  and  the  branching  process  continued  down  the  tree 
until  fathoming  again  took  place. 

This  type  of  construction  b termed  direct  descent  because  the  method  of  branching  is  pre- 
determined; that  is,  no  search  of  the  tree  is  required  to  decide  the  item  and  the  day  to  be  consid- 
ered next.  The  direct  descent  approach  seemed  very  appropriate  here  because  so  few  computations 
were  involved  in  moving  from  one  node  in  the  tree  to  an  adjacent  node. 

Satiafyiaf  the  Conetrainta 

An  m 1^  n matrix  and  an  n-dimensional  vector  were  utilized  to  guarantee  satisfaction  of  the 
separation  and  upper  bound  constraints,  respectively.  The  matrix  was  initialized  to  zero,  and  the 
jth  position  of  the  array  was  assigned  the  upper  bound  on  the  number  of  times  item  j could  appear 
during  the  schedule.  When  item  j was  allocated  on  day  i and  item  h had  to  be  separated  from  i by 
d days,  a one  was  added  to  column  h in  locations  itd  through  t ft*  d.  Also,  when  item  j was  allocated, 
a one  was  subtracted  from  the  ^th  element  of  the  array.  Thus,  at  any  stage,  item  n could  only  be 
allocated  on  day  k if  the  {k,  r)th  element  of  the  matrix  was  zero  and  the  rth  element  of  the  array 
was  not  zero.  When  backtracking  and  a previously  allocated  item  became  prohibited,  the  process 
was  reversed. 

Determlniiig  Boands 

A lower  bound  on  the  best  solution  to  be  found  in  TF(5)  was  obtained  by  solving  a relaxation 
of  the  problem  conditioned  on  the  assignments  made  up  to  that  stage.  A pass  was  made  from  the 
day  currently  under  cmisideration  onward  to  day  m,  which  temporarily  allocated  the  least  cost 
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item  available  on  each  day  by  considering  only  the  separation  constraints  incurred  as  the  result 
of  allocations  made  up  to  the  current  day.  If  the  total  number  of  allocations  (both  temporaiy  and 
forced  by  branches  above)  for  an  item  exceeded  the  upper  bound  on  the  item,  the  total  number  of 
times  the  item  was  allocated  dropped  back  to  the  upper  bound,  and  the  difference  was  added  to 
the  number  of  times  that  the  next  least  cost  item  was  allocated.  A lower  bound  L{N)  for  node  N 
was  then  given  by  summing  the  product  of  the  number  of  times  an  item  was  allocated  and  the 
item’s  cost. 

The  difference  between  the  SSP  and  CSSP  computer  codes  in  our  implementation  consists 
of  checks  to  eliminate  translations  and  mirror  images  of  solutions  already  considered.  This  yields 
better  bounds,  as  more  prohibitions  exist  on  one  side  of  the  tree.  With  allocations  being  made 
sequentially  over  the  days,  translations  as  recognizable  through  dormant  nodes  occur  only  on  day 
1,  and  mirror  images  as  recognizable  through  bare  nodes  occur  only  on  day  2. 


TABLE  I.  Computational  ReauUa  With  the  SSP  and  CSSP  Algorithms  Solving  Fifteen  Randomly 

Oenerated  Test  Problems 


Problem  No. 

Problem  Size 

Time  {CP  seconds) 

m 

n 

SSP 

CSSP 

1 

11 

30 

>360t 

22.06 

2 

7 

30 

21.  70 

1. 14 

3 

7 

30 

68.90 

2.66 

4 

7 

30 

23.69 

1.07 

5 

14 

40 

♦ 

98.49 

6 

8 

70 

112.  10 

4.64 

7 

9 

40 

36.00 

1.87 

8 

10 

30 

70.82 

5.32 

9 

21 

45 

• 

191.31 

10 

7 

60 

2.90 

.20 

11 

7 

60 

1.68 

. 14 

12 

10 

20 

50.58 

2.49 

13 

10 

20 

>300t 

10.  23 

14 

14 

60 

242.  36 

16 

21 

60 

♦ 

232.  51 

*Indicate8  problem  not  attempted  with  SSP  algorithm. 

t>N  indicates  optimality  not  verified;  run  terminated  after  N seconds. 


To  test  the  efficiency  of  the  SSP  and  CSSP  computer  codes  and  to  compare  the  two,  fifteen 
sample  problems  were  randomly  generated.  All  sample  problems  had  a separateon  matrix  approx- 
imately 40%  dense,  and  the  separation  distance  varied  with  the  input  parameters.  Costs  were 
uniformly  distributed  between  n and  n+bn,  70%  of  the  upper  bounds  were  1,  25%  were  2,  and  the 
remainder  were  3.  The  test  runs  were  made  on  the  CDCOOOO  computer  system  at  the  University 
Tesas  at  Austin,  and  all  time*  are  CPU  seconds  required  to  find  the  optimal  solution  and  prove 


CYCLIC  SGFABATION  SCHEDULING  517 

optimality.  As  can  be  seen,  the  CSSP  algorithum  performed  remarkably  better  in  every  instance; 
in  fact,  no  attempt  was  made  to  solve  the  four  larger  problems  with  the  SSP  algorithm. 

One  modification  of  the  algorithm  was  tested  to  see  what  effect  better  bounds  would  have  on 
solution  times.  A transportation  problem  [3]  was  solved  at  each  node  to  obtain  a lower  bound  uni- 
formly stronger  than  that  obtained  with  the  method  previously  described.  Because  of  the  cyclic 
nature  of  the  separation  constraints,  the  resulting  transportation  problem  was  not  always  trivial. 
The  steps  saved  in  branching  did  not  compensate  for  the  additional  time  required  to  solve  the 
transportation  problem,  and  poorer  overall  times  resulted. 

5.  CONCLUSIONS 

This  paper  has  presented  a special  purpose  branch-and-bound  algorithm  which  takes  ad- 
vantage, through  the  concepts  of  translation  and  mirror  images,  of  the  cyclic  separation  scheduling 
problem’s  structure.  It  may  also  be  possible  to  take  a similar  advantage  when  solving  other  cyclic 
problems.  One  trivial  extension  involves  a scheduling  problem  where  an  item  must  be  within  a 
certain  distance  of  another  item  when  both  are  scheduled.  The  only  modification  comes  in  preserving 
feasibility,  where,  in  our  implementation,  the  locations  in  the  matrix  to  be  incremented  or  de- 
cremented by  1 would  change. 

In  addition  to  the  menu-planning  application  described  in  some  detail  by  Balintfy  [2], 
the  cyclic  separation  scheduling  model  may  be  useful  in  an  advertising  campaign.  In  this  context, 
several  commercials  must  be  scheduled  over  a specified  period  to  maximize  consumer  appeal.  A 
modification  to  the  model  would  arise  when  certain  commercials  compliment  each  other  and,  if 
scheduled  at  all,  should  be  scheduled  within  a specified  time  period  of  one  another.  This  would 
require  the  algorithm  to  combine  the  techniques  discussed  in  the  previous  paragraph  with  those 
described  in  Section  4. 
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A PRIMAL  SIMPLEX  ALGORITHM  TO  SOLVE  A RECTILINEAR 
DISTANCE  FACILITY  LOCATION  PROBLEM 


Ronald  D.  Armstrong 

Th»  UniMnity  of  Ttxat 
AtuHn,  Ttxa* 


ABSTRACT 

This  p^>er  considers  the  problem  of  locating  m new  facilities  in  the  plane 
so  as  to  minimise  a weighted  rectangular  distance  between  the  new  facilities  and 
n existing  facilities.  A special  purpose  primal  simplex  algorithm  is  developed  to 
solve  this  problem.  The  algorithm  will  maintain  at  all  times  a basis  of  dimension 
m by  m;  however,  because  of  the  triangularity  of  the  basis  matrix,  it  will  not  be 
necessary  to  form  a basis  inverse  explicitly. 

INTRODUCTION 

The  problem  considered  in  this  paper  involves  assigning  positions  in  the  plane  for  the  location 
of  m new  facilities  so  as  to  minimize  a weighted  rectangular  distance  between  the  new  facilities  and 
n existing  facilities.  This  problem  has  been  studied  by  Cabot,  Francis,  and  Stary  [3],  Wesolowsky 
and  Love  [15],  and  Morris  [llj.  These  articles  propose  a solution  by  converting  the  problem  to  a 
linear  program  and  then  utilizing  the  simplex  algorithm.  They  also  give  references  to  related  non- 
simplex  work  in  the  area.  Additional  references  for  location  theory  are  given  in  a review  article  by 
Francis  and  Goldstein  [6]. 

The  rectangular  distance  facility  location  problem  can  be  stated  mathematically  as 

«•  «+«•  m-l  ■ 

(1)  Minimize  «u(l*H-o,y|  + |z„-a„|) + ^ j„(|xh-z„|  + |z«-Xi,l), 

where  (zu,  Xti)  is  the  location  of  the  tth  new  facility  in  the  plane,  (oiy,  Oi^)  is  the  location  of  the 
jth  existing  facility  in  the  plane,  and  is  a positive  weight  on  the  rectilinear  distance  between 
facilities  i and  j. 


A more  general  statement  of  the  problem  of  finding  parameters  to  minimize  the  rectangular 
distances  between  given  points  and  associated  linear  functions  of  these  parameters  is  given  by 


620 


R.  D.  ARMSTRONG 


where  the  scalar  6y  and  the  t-dimensional  vector  df  are  given  for  all  j,  and  x is  the  vector  of  param- 
eters which  must  be  determined.  Here,  the  positive  weights,  if  present,  are  brought  inside  the 
absolute  value  sign  and  are  reflected  in  6y  and  dj.  Two  common  names  given  to  (2)  are  least  absolute 
deviations  curve-fitting  problems  and  A norm  regression  problems. 

Applications  which  give  rise  to  problem  (2)  are  found  in  statistics  [2],  numerical  analysis  [12], 
and  management  science  [4,  5, 10).  Chames,  Cooper,  and  Ferguson  [5]  were  the  first  to  utilize  linear 
programming  to  solve  (2),  and  it  is  generally  recogifized  that  this  is  the  most  efficient  method  to 
solve  the  problem  [1,  13].  A linear  programming  equivalent  of  (2)  is 

(3)  Minimize  ^ 

subject  to 

hi—di^x—P)-]rNf=Q, 

P)>Q  and  Nf>Q,  j=l,  2, 

Wagner  [14]  demonstrates  how,  upon  taking  the  dual  of  (3),  the  problem  becomes  one  with 
upper  bound  restrictions  on  all  the  dual  variables  and,  hence,  would  require  the  working  basis  to 
be  a t by  ( matrix.  This  makes  the  problem  much  more  tractable  with  existing  linear  programming 
codes,  because,  generally,  t will  be  substantially  less  than  s.  It  is  an  adaptation  of  Wagner’s  approach 
that  is  used  by  Cabot,  Francis,  and  Stary  [3]  in  reducing  (1)  to  a network  problem. 

Because  of  the  special  structure  of  (3),  it  can  also  be  solved  with  a primal  algorithm  employing 
a working  basis  of  size  ( by  (.  Barrodale  and  Roberts  [1]  and  Spyropoulos,  Eaountouzis,  and  Young 
[13]  report  superior  computational  results  in  solving  the  primal  problem  directly  with  a special 
piuqK>se  algorithm.  An  extension  of  such  an  algorithm  will  be  presented  here  to  solve  (1). 


A UNEAR  PROGRAMMING  FORMULATION 


An  initial  observation  concerning  (1)  is  that  it  decomposes  immediately  into  two  disjoint 
problems — one  for  each  coordinate  in  the  plane.  Because  of  this,  we  will  now  drop  the  double 
subscript  notation  on  X and  a,  and  refer  to  the  problem. 

m R+m  m— 1 m 

(4)  Minimize  ^ ^ 9.ti\Xi—X,\. 


Cabot,  Francis,  and  Stary  [3]  show  that  (4)  is  equivalent  to 


(5) 

subject  to 


Minimize  ^ 

Xt-^-Pt—Nn^a,,  i=l,  2 m 

;=m+l,  m-1-2, . . .,  n+m; 
Xi — Xj-\-Pif — Ni,=0,  i=l,  2, . . . m — 1; 

i='i+l m; 

N,i>Q  and  P«>0,  for  all  (i,  j), 
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where  N„  and  P,j  are,  respectively,  the  negative  and  positive  absolute  deviation  of  facility  i from 
facility  j in  the  coordinate  under  consideration. 

It  can  be  shown  [3]  that  the  dual  of  (5)  is  a capacitated  network  problem,  once  additional 
variables  and  constraints  have  been  added.  A primal  algorithm  will  be  presented  here  to  solve  a 
slightly  modified  version  of  (5).  Although  all  the  results  given  here  can  be  obtained  by  considering 
only  (6),  it  is  felt  that  the  following  equivalent  problem  makes  the  development  more  intuitive. 


(6) 

subject  to 


Minimize 


» «+• 


Nt,,  i = l,  2,  . . .,  m; 

j=m  + l,  m + 2, . . .,  m+n; 
P 2, . . .,  m — 1; 

i=i+l m; 


Nf)  > 0 and  P^  > 0 for  every  (i,  j) . 


In  matrix  notation,  the  constraints  of  (6)  can  be  written 


(7) 


-p<Ax<m+iv, 


N>0  and  P>0. 


It  is  easUy  shown  [1,  13]  from  fundamental  linear  programming  theory  that  an  optimal  basis 
for  (6)  (or  (5))  will  have  all  mXj’s  in  the  basis,  and  at  least  m constraints  will  have  Py=^y=0. 
Furthermore,  the  submatrix  of  A formed  from  these  m constraints  will  be  linearly  independent  and, 
once  they  have  been  chosen,  the  remaining  basic  variables  can  be  determined.  To  illustrate  this, 
we  partition  (7)  in  the  following  manner: 


where  P is  an  m by  m nonsingular  matrix  corresponding  to  the  m constraints  previously  mentioned. 
We  now  make  the  transformation  z=BX,  and  define  a new  equivalent  problem: 


^8) 

subject  to 


m n+m 


Minimize  g qij(Ptj+N„), 


A^>0  and  P>0. 


Since  Pg=Na=0,  the  current  “basic"  solution  has  'z=Fb,  X=B~^'S\  Ptj  and  Nij  are  determined 
from  the  constraints  while  making  the  deviations  as  small  as  possible. 

We  will  now  demonstrate  how  several  standard  simplex  pivots  may  be  combined  into  one 
pivot.  We  initially  choose  a B which  is  our  working  basis  and  define  two  index  sets 
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where  R is  the  index  set  of  in  P«  (or  in  Nb).  In  the  case  of  degeneracy  (Pii—Ntj=0),  (i,  j) 
may  be  assigned  initially  to  either  / “ or  I'*'.  The  objective  value  associated  with  this  basis  is 


S S ^ QijiPu~RtjR  S iniRijR 

(ijJil*  (tjhl-  (ijTil*  (ijjil- 


where  Rtj  is  a row  of  R and  is  an  element  of  F;  that  is,  P</=0  or  a/.  It  is  now  seen  that  by  vary- 
ing an  element  of  2,  e.g.  2^,  away  from  its  current  value  It  by  S,  and  leaving  the  remaining  elements 
of  2 fixed,  the  objective  value  will  change  by 

(9) 


S S{jRi/Rt~'^ — i Aid- I 


where  Rt~‘  is  the  itth  column  of  R~‘  and  is  the  weight  associated  with  the  ibth  row  in  the 
working  basis.  In  other  words,  if  |At|  is  greater  than  then  the  objective  can  be  decreased  by 
varying  Zt  in  the  direction  prescribed  by  the  sign  of  At  (At<0  indicates  an  increase,  and  At>0 
indicates  a decrease). 

The  amount  that  Zt  can  vary  away  from  It  before  the  rate  of  change  in  the  objective  becomes 
nonegative  is  given  by  ordering  the  following  ratios: 


Pii  — 


-sgn  (£)R,yBt-‘’ 


sgn 


(A.)P.7g;-‘’  ^ (i.  i)  *1- 


Let  L{u)  denote  the  index  set  of  the  rows  yielding  the  u smallest  ratios.  After  |5|  equals  the  mini- 
mum ratio  {pi),  {i,  j)tL{\)),  the  indices  (i,  j)  will  move  from  I*  to  7"  (or  from  I~  to  /"'■)  and  the 
objective  change  given  by  (9)  will  be  updated  accordingly.  However,  the  change  will  be  nonnegative 
until  the  (i,  j)  present  in  L{h)  but  not  in  L(/i-l)  moves  from  one  index  set  to  another,  where 


|A»|-  2 , 2|g„P.,B.-‘|>ai5’, 

lAi;|—  S 2|g«^i?,^P,“'|<giy. 

(U)«X(*) 


The  (i,  j)  row  present  in  L{h)  but  not  in  L{h-\)  will  not  enter  the  working  basis  in  place  of  (a,  $). 
This  iteration  of  the  algorithm  corr''sponds  to  h standard  simplex  iterations. 

The  computational  simplifications  which  are  possible  because  of  the  special  structure  of  the 
A matrix  will  be  discussed  in  the  next  section. 


IMPLEMENTATION  FOR  FACILITY  LOCATION  AND  A SAMPLE  PROBLEM 


If  the  problem  to  be  solved  is  of  the  form  given  by  (6),  an  efiicient  implementation  of  the 
algorithm  outlined  in  the  previous  section  requires  an  adaptation  of  the  techniques  utilized  in 
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solving  network  problems.  Hence,  the  basis  matrix  B will  never  be  inverted,  but  rather  saved  in 
a spanning  tree  format  [7,  8],  and  the  triangularity  of  B will  allow  a direct  solution  of  linear  sys- 
tems involving  B.  In  order  to  facilitate  the  relationship  between  (6)  and  a standard  capacitiated 
network  problem,  it  is  conceptually  convenient  to  add  a dummy  variable  to  the  first  (m) 
(n)  constraints.  These  constraints  would  then  appear  as 

ai-Pi,<Xt-Xo<a)+Nt„  i=l,  2,  . . .,  m,j=m+l,  m+2,  . . m+n, 

with  the  additional  restriction  that  Xo=0.  The  transpose  of  the  augmented  A matrix  now  has  a 
network  structure. 

The  linear  system  B X=Fb  can  be  solved  eflSciently  through  the  use  of  link  list  structures 
[7,  8]  in  the  same  manner  as  the  dual  variables  are  obtained  in  the  standard  network  problem. 
That  is,  X^  (which  will  always  be  at  the  root  of  the  tree)  is  set  equal  to  zero,  and  the  remaining 
variables  are  obtained  by  a single  pass  through  the  tree.  Because  the  sweep  of  the  tree  begins  by 
setting  ^*0=0,  the  necessity  of  self-loop  (slack)  arcs  is  alleviated  and  the  spanning  tree  structure 
is  preserved. 

To  check  for  the  optimality  of  the  current  solution,  and  to  determine  the  row  to  leave  the 
basis,  the  system 

AB=(A„  A„) 

must  be  solved  for  A,  where 

B=  Sq  ifR  y — 22  t)R 

The  values  for  A can  be  obtained  by  working  back  to  the  root  of  the  tree,  and  the  additional  equa- 
tion which  may  be  created  by  the  dummy  column  is  redundant. 

To  demonstrate  the  algorithm,  a sample  problem  with  three  existing  facilities  and  three  new 
facilities  will  be  solved  in  one  coordinate.  The  locations  of  the  existing  facilities  are  given  by  o,=7, 
Oj=ll,  and  01=16.  The  matrix  of  weights,  or  g</s,  is  given  in  Table  1. 


TABLE  1.  Weights  on  the  Rectilinear  Distances  Between  Facilities  in  the  Sample  Problem 

Facility 


1 

Facility 

2 


3 


1 

2 

3 

4 

5 

6 

— 

2 

1 

1 

1 

6 

2 

--- 

1 

1 

1 

3 

1 

1 

--- 

8 

1 

1 

The  initial  basis  { (1,  5),  (2,  5),  (3,  5) ) places  the  three  new  facilities  at  the  location  of  facility  5. 
The  spanning  tree  representation  of  this  basis  and  the  basis  at  other  iterations  is  given  in  Figure  1. 


624 


R.  D.  ARMSTRONO 


FIGURE  1.  (a)  Spanning-tree  representation  of  the  first  three  bases,  (b)  Spanning-tree  representation  of  the  final 
basis.  Although  the  first  three  bases  have  the  same  representation,  the  node  potentials  of  the  tree  are  different. 


With  the  first  basis,  we  have 

6),  (2,  6),  (3,  6),  (1,  2),  (1,  3),  (2,  3)) 

/-={(!,  4),  (2,4),  (3,4)} 

and  A=(Ai,  Aj,  A|)  = (— 8,  —1,  9).  Row  (3,  5)  is  chosen  to  leave  the  basis  and  row  (3,  4)  enters  the 
basis.  In  this  situation,  row  (3,  4)  yields  the  third  smallest  ratio;  hence,  the  first  iteration  of  the 
algorithm  combines  three  standard  simplex  iterations  into  one. 

The  second  basis  is  {(1,  5),  (2,  5),  (3,  4))  and  has 
/+={(!,  6),  (2,6),  (1,2),  (3,6),  (3,6)} 

/-={(!,  4),  (2,4),  (1,3),  (2,3)} 

and  A=(— 6,  1,  —5).  Row  (1,  5)  leaves  the  basis  and  (1,  6)  enters  the  basis.  Row  (1,  6)  yields  the 
second  smallest  ratio;  hence,  this  iteration  combines  two  standard  simplex  iterations  into  one. 

The  third  basis  is  { (1,  6),  (2,  5),  (3,  4) } and  has 
/+={(2,6),  (3,6),  (3,  6)} 

/-={(!,  4),  (1,  6),  (2,  4),  (1,  3),  (2,  3),  (1,  2)} 

and  A=(6,  —3,  —4).  Row  (2,  6)  leaves  the  basis  and  row  (1,  2)  enters  the  basis. 

The  fourth  basis  is  {(1,  6),  (1,  2),  (3,  4)}  and  has 
/+={(2,  6),  (3,6),  (3,6)} 

/-  = {(!,  4),  (1,  6),  (2,  4),  (2,  6),  (1,  3),  (2,  3)) 

and  A=(3,  0,  —4).  This  basis  is  optimal  because  ^16=6,  gii=2  and  gs<=8.  The  optimal  objective 
value  is  69  with  .Y|=16,  .Y,=  16  and 

A detailed  description  of  the  computational  steps  required,  such  as  updating  the  spanning 
tree,  will  not  be  given  here,  but  they  are  analogous  to  those  found  in  capacitated  network  algo- 
rithms. In  particular,  an  eflScient  procedure  for  calculating  the  ratios,  p^,  parallels  the  method  for 
calctilating  the  ratios  in  a dual  simplex  network  code  [9].  This  is  what  one  would  expect,  as  the 
dual  of  (6)  is  a capacitated  network  problem,  and  solving  the  duel  with  a dual  simplex  algorithm 
is  the  same  as  solving  the  primal  with  a primal  simplex  algorithm.  The  advantages  to  our  algorithm 
are:  (a)  a feasible  solution  to  (b)  is  immediately  obtained  by  choosing  any  spanning  tree  and 
(b)  several  simplex  iterations  may  be  combined  into  one. 


BECTILINEAB  DISTANCE  FACILITT  LOCATION 


625 


REFERENCES 

[1]  Barrodale,  I.,  and  F.  D.  K.  Roberts,  “An  Improved  Algorithm  for  Discrete  Li  Linear  Ap- 

proximation,” SIAM  Journal  of  Numerical  Analysis,  10,  839-848  (1973). 

[2]  Barrodale,  I.,  “2^  Approximation  and  the  Analysis  of  Data,”  Applied  Statistics,  17,  51-57 

(1968). 

[3]  Cabot,  A.  V.,  R.  L.  Francis,  and  M.  A.  Stary,  “A  Network  Flow  Solution  to  a Rectilinear 

Distance  Facility  Location  Problem,”  AIIE  Transactions,  2, 132-141  (1970). 

[4]  Chames,  A.,  and  W.  W.  Cooper,  Management  Models  and  Industrial  Applications,  Vol.  I and  II 

(John  Wiley  A Sons,  Inc.,  New  York,  1961). 

[5]  Chames,  A.,  W.  W.  Cooper,  and  R.  Ferguson,  “Optimal  Estimation  of  Executive  Compensa- 

tion by  Linear  Programming,”  Management  Science,  2, 138-151  (1955). 

[6]  Frances,  R.  L.  and  J.  M.  Goldstein,  “Location  Theory:  A Selective  Bibliography,”  Operations 

Research,  ff,  400-410  (1974). 

[7]  Glover,  F.,  D.  Kamey,  and  D.  Klingman,  “The  Augmented  Predecessor  Index  Method  for 

Locating  Stepping  Paths  and  Assigning  Dual  Prices  in  Distribution  Problems,”  Transpor- 
tation Science,  6, 171-180  (1972). 

[81  Glover,  F.,  D.  Kamey,  D.  Klingman,  and  A.  Napier,  “A  Computational  Study  on  Start 
Procedures,  Basis  Change  Criteria,  and  Solution  Algorithms  for  Transportation  Problems,” 
Management  Science,  20,  793-814  (1974). 

[9]  Glover,  F.,  D.  Klingman,  and  A.  Napier,  “An  Efficient  Dual  Approach  to  Network  Problems” 
Opsearch  P(l),  1-19  (1972). 

[10]  Lee,  S.  M.,  Qoal  Programming  jor  Decision  Analysis,  (Auerbach,  Philadelphia,  1972). 

[11]  Morris,  J.  G.,  “A  Linear  Programming  Solution  to  the  Generalized  Rectangular  Distance 

Weber  Problem,”  Naval  Research  Logistics  Quarterly,  22,  155-164  (1975). 

[12]  Rabinowitz,  P.,  “Applications  of  Linear  Programming  to  Numerical  Analysis,”  SIAM  Review, 

10,  121-159  (1968). 

[13]  Spyropoulos,  K.,  E.  Kiountouzis,  and  A.  Young,  “Discrete  Approximation  in  the  Li  Norm,” 

The  Computer  Journal,  16,  180-186  (1973). 

[14]  Wagner,  H.  M.,  “Linear  Programming  Techniques  for  Regression  Analysis,”  Journal  of  the 

American  Statistical  Association,  B/^,  206-212  (1959). 

[15]  Wesolowsky,  G.  O.  and  R.  F.  Love,  “The  Optimal  Location  of  New  Facilities  Using  Rec- 

tangular Distances,”  Operations  Research,  19,  124-130  (1971). 


MONOTONE  FAILURE  RATES  FOR  MULTIVARIATE  DISTRIBUTIONS* 


Henry  W.  Block 

Univerrity  of  PitUburgh 
PtiUburgk,  Penruylvania 


ABSTRACT 


It  is  shown  that  the  monotone  multivariate  failure  rates  of  Brindley  and 
Thompson  have  no  natural  analog  involving  the  multivariate  failure  rate  function 
of  Basu  for  absolutely  continuous  distributions.  Quantities  related  to  the  multi- 
variate failure  rate  fimction  are  used  to  define  monotone  failure  rates.  It  is  shown 
that  these  are  equivalent  to  the  monotone  failure  rates  of  Brindley  and  Thompson. 
Based  on  these  quantities,  the  loss  of  memory  property  of  Marshall  and  Olkin  is 
characterised. 


1.  INTRODUCTION 

For  multivariate  distributions,  many  different  concepts  of  monotone  failure  rate  can  be 
defined.  Several  concepts  of  monotone  failure  rates  were  introduced  by  Brindley  and  Thompson  [5]. 
For  absolutely  continuous  bivariate  random  variables,  Basu  [2]  introduced  the  multivariate  failure 
rate  function.  As  in  the  univariate  case,  it  seems  reasonable  to  attempt  to  define  increasing  and 
decreasing  failure  rates  in  terms  of  this  multivariate  failure  rate  function.  In  Section  3,  we  attempt 
to  show  that  if  this  is  done  using  this  function,  the  resulting  concepts  do  not  parallel  the  monotone 
failure  rates  of  Brindley  and  Thompson. 

In  Section  4,  we  utilize  functions  related  to  the  multivariate  failure  rate  fimction  of  Basu  in 
order  to  define  monotone  failure  rates.  These  functions  turn  out  to  be  the  components  of  a hazard 
gradient  introduced  independently  by  Johnson  and  Kotz  [7]  and  Marshall  [8].  The  uses  of  these 
functions  by  these  authors  and  by  the  present  author  are  quite  different.  Coimections  will  be 
discussed  in  a subsequent  paper.  It  is  shown  that  these  monotone  failure  and  hazard  rates  corre- 
spond to  the  monotone  failure  rates  of  Brindley  and  Thompson.  Some  consequences  are  given. 
Based  on  the  concepts  of  Section  4,  a characterization  of  the  loss  of  memory  property  (LMP)  of 
Marshall  and  Olkin  [9]  is  given  in  Section  5.  A concept  of  increasing  hazard  rate  due  to  Harris  [6] 
which  is  less  important  than  the  concept  of  Brindley  and  Thompson  is  discussed  in  the  appendix. 


•Partially  supported  by  the  Office  of  Naval  Research  imder  Contract  N00014-75-C-0781  with  the  University 
of  California  and  Contract  N00014-7fi'C-0839  with  the  University  of  Pittsburgh. 
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In  discussing  monotone  failure  rates,  the  terms  “increasing”  and  “decreasing”  will  mean, 
respectively,  nondecreasing  and  nonincreasing.  Since  we  mainly  have  in  mind  survival  times,  when 

we  discuss  n random  variables  Xi AT,,  we  shall  assume  that  . . .,  .Y,>0)  = 1 and 

that  the  arguments  of  all  distribution  and  density  functions  are  positive.  An  abuse  of  terminology 
which  we  use  is  calling  F(x,,  . . .,  x^=P{Xi'^Xi a distribution  function  (d.f.). 

In  the  univariate  case,  the  failure  rate  function 


for  absolutely  continuous  distributions  is  increasing  (decreasing)  if  and  only  if 

P(r+A) 


P(a;-U)-P(r)_^ 

F(x) 


7(x) 


is  increasing  (decreasing)  in  x for  each  A,  i.e.,  F(jt+A)/P(x)  is  decreasing  (increasing)  in  x for 
each  A.  In  either  case,  we  say  that  the  distribution  is  IFR  (DFR). 


This  second  concept  has  been  generalized  by  Harris  [6]  and  by  Brindley  and  Thompson  [5], 
and  both  of  these  generalizations  involve  the  assumption  that 


(2.1) 


^(xrf  A x,+A) 

Fixi x.) 


is  decreasing  (increasing)  in  Xi,  . . .,  x,  for  each  A.  We  give  the  definition  of  Brindley  and  Thompson 
below.  The  concept  of  Harris,  which  is  of  less  interest,  will  be  discussed  in  the  appendix. 

DEFINITION:  A multivariate  d.f.  P(xi,  . . .,  x,)  for  the  random  variables  Xi,  . . .,  X,  has 
increasing  failure  rate  (IFR)  if  (•)  below  is  satisfied,  and  has  joint  IFR  if,  in  addition,  (*)  is  satisfied 
for  all  subfamilies  of  the  n random  variables.  This  latter  concept  is  designated  MIFR  by  Barlow 
and  Proschan  [1]. 


(•)P(X,>xt+A X,>x,+AlXi>z,, . . .,  X,>x.)='^^^.^^’  ' ' ’’  is  decreaMng  in  Xi,  . . ., 

F{Xi, . . .,  X,) 


X,  for  each  A. 


Decreasing  failure  rate  (DFR)  and  joint  DFR  are  defined  by  interchanging  the  word  increasing 
for  decreasing  in  (*). 

For  absolutely  continuous  random  variables,  the  failure  rate  function  has  been  generalized 
to  the  multivariate  case  by  Basu  [2]  and  is  given  in  the  following  definition. 

DEFINITION:  Let  F(xi,  . . .,  x»)  have  an  absolutely  continuous  distribution  with  density 
/(*! Xn).  Then 


(2.2) 


r(x, • • •’ 

Fix, X.) 


is  called  the  multivariate  failure  rate  function. 
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To  our  knowledge,  the  concept  of  monotone  failure  rate  for  this  function  has  not  been  investigated. 
A concept  related  to  the  monotone  failure  rates  above  is  the  loss  of  memory  property  (LMP) 
of  Marshall  and  Olkin  [9].  A distribution  is  said  to  have  the  LMP  if 

P(X,>x,+A X,>x,+A\X,>x, .y.>x.)=P(A’,>A A',>A) 

or,  equivalently, 

(2-3)  7(xi+A x,+A)=T(x,,  . . .,  x,)P(A,  . . .,  A) 

for  all  Zi,  . . .,  z,  and  A.  This  property  is  possessed  by  the  multivariate  exponential  distribution 
(MVE)  of  Marshall  and  Olkin  [9],  the  bivariate  exponential  extension  of  Block  and  Basu  [4],  and 
the  multivariate  exponential  extension  of  Block  [3].  It  characterizes  these  distributions  when  the 
marginals  are  specified. 


3.  THE  BIVARIATE  FAILURE  RATE  FUNCTION 


As  mentioned  in  the  previous  section,  we  are  aware  of  no  study  of  monotone  failure  rate  for 
the  multivariate  failure  rate  function  (2.2).  One  possible  reason  for  this  is  that  there  do  not  appear 
to  be  close  analogies  between  monotone  failure  rates  for  this  function  and  the  monotone  failure 
rate  concepts  of  Brindley  and  Thompson  [5].  In  the  present  section,  we  demonstrate  this  lack  of 
close  relationship  in  the  bivariate  situation. 

In  attempting  to  define  monotone  failure  rates  using  the  bivariate  failure  rate  function,  two 
possibilities  seem  most  natural.  The  first  of  these  represents  a straightforward  mathematical 
generalization  from  the  univariate  case. 

CONDITION  3.1: 


r(z,,  xt) 


/(Xi,  Xj) 

F{xi,  xj) 


is  increasing  (decreasing)  in  Xi  and  z>. 

This  condition,  however,  appears  too  strong,  since  the  boundary  between  the  increasing  and 
the  decreasing  cases  is  thaty(xi,  x^lT{xi,  xj)  is  constant.  In  the  case  of  exponential  marginals, 
Basu  [2]  showed  that  this  implies  that  the  bivariate  distribution  htis  independent  marginals.  Puri 
[10]  showed  that  this  was  also  true  under  the  apparently  weaker  assumption  that  at  least  one  of 
the  marginals  is  exponential.  If  the  marginals  are  unspecified,  Puri  and  Rubin  [11]  showed  that 
the  joint  distribution  must  be  a mixture  of  independent  exponentials;  therefore,  the  mar^nals 
are  mixtures  of  univariate  exponentials.  These  in  general  have  a decreasing  failure  rate,  which 
is  not  very  satisfactory  for  a boundary  distribution. 

A second  condition,  implied  by  the  first  condition,  is  based  on  the  observation  that  for  the 
bivariate  exponential  extension  of  Block  and  Basu  [4],  the  bivariate  failure  rate  function  r(z,+A, 
xj+A)  is  constant  in  A for  each  Xi  and  Xj.  As  shown  in  Section  5,  this  property  is  shared  by  any 
absolutely  continuous  distribution  having  the  LMP. 

CONDITION  3.2; 

r(xi+A,  Xj+A) 


is  increasing  (decreasing)  in  A for  all  X|  and  Xj. 
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The  following  examples  will  show  that  these  concepts  are  not  closely  related  to  the  mono- 
tone failure  rates  discussed  in  the  previous  section,  although  the  second  concept  is  related  to  the 
LMP,  as  will  be  shown  in  Theorem  5.2. 

Condition  3.1  implies  neituer  the  IFR  (DFR^  nor  the  joint  IFR  (DFR).  This  is  shown  in 
Fxample  1.  This  example  also  shows  that  Condition  3.2  implies  neither  of  these  properties,  since 
it  is  weaker  than  the  first  property.  The  second  example  shows  that  neither  the  IFR  (DFR)  nor 
the  joint  IFR  (DFR)  implies  the  increasing  version  of  Condition  3.1. 


EXAMPLE  1:  Let 


F(xu  xt)=p  exp  (-(X,x,-|-X.X,-‘a;a))-h(l-p)  exp  (-(Xjx.+X-Xj-'  x,)) 


where  0<p<l  and  X,  Xj>0.  Then 


/(Xi,  Xa)  _ 
^(Xi,  Xj) 


so  Conditions  3.1  and  3.2  are  clearly  satisfied.  However,  it  can  be  shown  that,  for  certain  choices 
of  Xi,  X],  and  X, 


Xj+t) 
F(x„  X,) 


is  strictly  decreasing  in  x,  and  strictly  increasing  in  xj,  so  the  distribution  is  neither  IFR  nor  DFR. 
EXAMPLE  2:'Ut 

F{xx,  exp  (— XjXi— XaX2— Xia  max  (xi,  Xa))— exp  (—X  max  (xj,  Xa)) 


where  Xi,  Xa,  Xia)]>0  and  X=Xi-f-Xa-|-Xia, 


which  is  the  bivariate  exponential  extension  of  Block  and  Basu  [4].  This  distribution  has  the  LMP 
which  implies  that  it  is  IFR,  and  it  can  be  shown  that  the  marginals  are  IFR.  Thus,  it  is  joint 
MIFR.  However, 


r(xi,  Xa)=.^ 


Xi  (Xa-FXia)  X .,  . 

X-X.a  exp  (-X,(xa-x,)) 

(Xi-FXia)  XaX ., 

X-X,a  exp  (-Xa(x,-xa)) 


which  is  strictly  increasing  in  Xi  for  Xi<xa  but  strictly  decreasing  in  Xi  for  Xi>xa,  so  that  the  first 
condition  is  not  satisfied. 

Since  the  bivariate  failure  rate  function  does  not  seem  to  be  closely  related  to  the  monotone 
failure  rates  discussed  previously,  in  the  next  section  we  introduce  functions  which  lead  to  failure 
rates  which  parallel  those  formerly  discussed. 


4.  ALTERNATE  MULTIVARIATE  FAILURE  RATES 


Since  the  bivariate  failure  rate  function 
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where 

does  not  appear  to  be  compatible  with  the  monotone  failure  rates  of  Brindley  and  Thompson 
[5],  we  consider  the  functions 

Xj)  ^l(^l)  ^i) 

7(x,,  Xt)  F(xi,  Zj) 

in  the  bivariate  case,  or  more  generally 


Xn) 

F{Xx,  . . Xn) 


for  t=l,2, 


.,n 


in  the  multivariate  case.  The  negatives  of  these  functions  are  the  components  of  the  hazard  gradient 
mentioned  in  the  introduction,  and  give  the  lith  one-dimensional  failure  rate  functions  upon  setting 
Zj=0  for  j 

DEFINITION : The  d.f.  F(xi z,)  for  the  random  variables  Xi,  . . .,  Xn  is  IFRn  (DFR,) 

if  (**)  below  is  satisfied,  and  is  joint  IFR^  (joint  DFRa)  if,  in  addition,  (**)  is  satisfied  for  all  sub- 
families of  the  n random  variables.  It  is  assumed  in  the  following  that  the  first  partial  derivatives 
exist. 

(**)  For  each  i=l,2,  . . .,  m. 


F,,(xi+A,  ■ . .,  z,-bA) 
Ffzi-fA x»+A) 


^ F(zi-fA,  . . .,  z,-t-A) 
F(xi+A x,+A) 


is  decreasing  (increasing)  in  A for  each  Zi,  . . .,  z,. 

The  following  properties  hold  for  random  variables  Xi,  . . Xn  with  d.f.  F(zi, . . .,  z,)  which 
are  joint  IFRa  (joint  DFRn) : 

(1)  A single  random  variable  which  is  joint  IFRa  (joint  DFR,)  is  IFR  (DFR)  in  the 
univariate  sense. 

(2)  The  imion  of  two  mutually  independent  sets  of  joint  IFR,  (joint  DFRa)  random 
variables  is  joint  IFRa  (joint  DFRa). 

(3)  Any  subset  of  joint  IFRa  (joint  DFi?,)  random  variables  is  joint  IFRa  (joint  DFRa). 

(4)  Sets  of  minimums  of  joint  IFRa  (joint  DFRa)  random  variables  are  joint  IFRa 
(joint  DFRa). 

The  proofs  of  (1)  and  (3)  are  by  definition  and  (2)  is  easy  to  show. 

PROOF  OF  (4): 

Let  Yi,  . . .,  Yn,  be  joint  IFRa  (joint  DFRa)  and  Jlf<=min  Yj  for  i=l,2,  . . .,  n where  J, 
c{l,  2,  . . .,  m}.  Let 


I,= 


Then 
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OXft 

[ n {x<>x<-i-A} 

l<-i 

^p| 

Oxk  1 

t-l 

^P 

dx* 

1 n {r>>yy+A}n  n 

" j,K, 

{Fy>X»-l-A}j 

p(n  {X,>x,+A} 

<-l  J 

p 

[ {Fy>yy+A}j 

P 

+ 

A 

S U.  <>^<>■'-+‘‘1  n A {r,>»,+^)  n !r,>a+A|}] 

^*{njr,>y,+A}j 


where  the  quantity  in  brackets  is  evaluated  at  yy=ar*  forjeK*  and  each  of  the  summands  is  decreas- 
ing (increasing)  in  A,  since  F, F,  are  joint  IFR,  (joint  DFR^).  This  implies  that  Xu  . . 

Xn  are  joint  IFR,  (joint  DFRa). 

It  is  not  surprising  that  random  variables  which  are  joint  IFR,  (joint  DFR,)  satisfy  the  pre- 
ceding four  conditions  (see  the  six  conditions  of  Section  2 of  Brindley  and  Thompson  [6])  in  the  light 
of  the  following  proposition. 

THEOREM  4.1:  Subject  to  the  existence  of  first  partial  derivatives  of  the  d.J.  7,  T is  IFR, 
(DFRa)  if  and  only  if  IFR  (DFR). 

PROOF:  Assume  FialFR,  (DFR,).  Then  for  each  t=l, . . .,  n, 


Ft,(xi+& x,-bA) 

F(xi-|-A,  . . .,  x,-|-A) 


is  decreasing  (increasing)  in  A for  fixed  xi, . . .,  x,.  Thus,  for  each  i=l n and  A>0, 

Ftiixi, . . X,)  ^ yjt<(xi-hA x«-bA) 

t 7’(xt,  . . .,  X,)  F(xi-|-A, . . .,  x,-|-A) 


It  then  foUows  for  each  t=l, . . .,  n and  A>0  that 

0>(<)  y(g|>  • • x.)7a(xi+A x,+A)— /^<(xi, . . .,  x,)/(xi-bA x,4-A) 

(F(Xr X.))* 

_ d F(xi-hA,  . . .,  x«-hA) 

^(*1 Xn) 

Thus, 

F(zi+A,  . . .,  x«-|-A) 

F(xu  • • ■!  x«) 


is  decreasing  (increasing)  in  xi,  . . .,  x,  for  each  A,  and  thus  is  IFR  (DFR).  Reversing  the  argument 
gives  the  proof  in  the  opposite  direction. 

COROLLARY:  Under  the  assumptions  of  Theorem  4.1  on  ¥,  F'w  joint  IFR,  (joint  DFR^  if 
and  only  if  /'is  joint  IFR  (joint  DFR). 


5.  A CHARACTERIZATION  OF  THE  LOSS  OF  MEMORY  PROPERTY 


We  now  turn  to  the  LMP  and  obtain  a characterization  which  is  related  to  the  preceding 
concepts.  Upon  differentiating  (2.3)  repeatedly  and  then  substituting  (2.3)  into  the  resulting 
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equation,  we  obtain,  for  each  positive  integer  m and  positive  integers  k\, . . .,  jr.  such  that  ki+kt+ 
. . . +kn=m 

Sr  — Sr  — 

• SxT'-  

F(zi+A,  . . .,  z,+A)  F(zi,  . . .,  X,) 


which  we  can  express  by  saying  that 


dxi 


/■dx,** . . . dx.»-  


F(xi, . . .,  X.) 


is  stationary  in  X|, . . x.  or  that 


8xi*‘dx,*»  • dx*'  

F(Xi+A,  . . .,  x,+A) 


is  constant  in  A for  all  Xi,  . . .,  x,.  In  particular,  if  F{xi,  . . .,  x.)  is  absolutely  continuous  with 
density  /(xi,  . . .,  x,),  then  the  multivariate  failure  rate  function  r(xi,  . . .,  x.)  is  stationary 
in  Xi,  . . .,  x«,  and  also,  for  i=l,2,  . . .,  n, 



TiXi X,) 

is  stationary  in  Xi,  . . .,  x,. 

It  is  natural  to  a^  if  any  of  these  conditions  characterize  the  IMP.  We  give  an  affirmative 
answer  in  the  following  theorem. 

THEOREM  5.1:  Under  the  assumption  that  the  first  partial  derivatives  of  F exist,  Fhas  the 
IMP  if  and  only  if 

4;  

F(x„  . . .,  X.) 


is  stationary  in  Xi,  . . .,  x,  for  each  i=l,2,  . . .,  n. 

PROOF : If  Y has  the  IMP,  then  one  direction  of  the  proof  is  obvious  from  the  remarks 
preceding  the  theorem.  Now  assuming  the  stationary  conditions,  for  all  i=l,2,  . . .,  n, 

?(x,+A x,+A)  T(xi, . . .,  X,) 

F(xi4-A,  . . .,  x,+A)  F(xi, . . .,  X,) 


The  left-hand  side  of  the  above  equation  is  equal  to 

* F(A,  . 


and  the  right-hand  side  is 


I;  !<«?<«„ 


. .,  x,+A) 
...A) 

. X,) 


y(xi-hA,  . . .,  x,-|-A)  Q 
F{xi, . . .,  x,)7(A A) 


so  that 
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Since  this  is  true  for  all  2,  . . n,  it  follows  that 

F(zi+A,  . . z,+A) 


(5.1) 


A) 


is  a fimction  only  of  A and  constant  in  Zi,  . . z..  This  gives  that  (5.1)  is  equal  to 

F(A A) 


F(0,  ...,0)7(A A) 


=0- 


This  gives  the  LMP,  and  the  result  is  proven. 

To  see  that  this  characterization  is  related  to  the  previous  material,  notice  that  for  i=l,  2, . . .,n, 

7(g|,  • • •.  Xn) 

F{xi z.) 

is  stationary  in  zi, . . .,  z,  if  and  only  if 

• • •’ 

F(zi+A,  . . .,  z,+A) 

is  constant  in  A for  each  Z],  . . z..  This  last  condition  is  then  seen  to  be  equivalent  to  F being 

both  IFRa  and  DFR^.  This  observation  also  leads  to  an  alternate  proof  of  the  preceding  theorem 
using  oim  Theorem  4.1  and  the  results  of  Brindley  and  Thompson  [5]. 

In  the  biviarate  case,  a related  result  is  of  some  interest.  Its  proof  is  similar  to  that  of  some 
of  the  preceding  results  and  is  omitted. 

THEOREM  5.2:  Subject  to  the  existence  of  the  appropriate  partial  derivatives,  any  two  of 

FxAxi,  zi)^  ^(zi,  xs)^  Fnx,  (xt,  xt) 

7(z,,  z,)  ’ E(zi,  zi)  ’ F{xi,  Xt) 

are  stationary  if  and  only  if  F has  the  LMP. 


APPENDIX 

Harris  [6]  was  one  of  the  first  to  introduce  a multivariate  increasing  failure  rate  concept.  He 
assumed  (*)  of  Section  2 and,  in  addition,  a type  of  dependence  called  right  comer  set  increasing. 
This  positive  dependence  concept  is  one  of  many  discussed  in  Chapter  5,  Section  4 of  Barlow  and 
Proschan  [1).  Assuming  any  of  these  in  conjunction  with  (*)  would  give  a different  monotone  fmlure 
rate  concept.  Since  this  concept  depends  on  the  specific  type  of  dependence  assumed,  it  appears 
to  be  less  important  than  the  concepts  of  Brindley  and  Thompson  discussed  in  the  main  body  of 
the  present  paper. 

For  completeness,  we  discuss  here  the  failure  rate  concept  of  Harris  in  the  context  of  the 
present  paper.  Harris  called  his  concept  an  increasing  hazard  rate. 

DEFINITION  (Harris);  A multivariate  d.f.  F{xi,  . . .,  z,)  for  the  random  variables  X\, 
. . .,  Xn  is  said  to  have  increasing  hazard  rate  (IHR)  if  it  is  IFR  in  the  sense  of  the  definition  of 
Section  2 and,  in  addition,  F(xi,  . . .,  z.)  is  right  comer  set  increasing  (RCSI) ; i.e.. 


MULTIVARIATE  MONOTONE  FAILURE  RATES 


635 


P{X,>x[ X^>x.] 

is  increasing  in  {xj|x(>X(}  for  each  choice  of  x[,  . . x'». 

Although  the  original  definition  of  RCSI  given  by  Harris  [6]  is  different  than  that  given 
above,  it  is  equivalent  to  it  by  Theorem  3.2  of  Brindley  and  Thompson  [5]  (modulo  certain  typo- 
graphical omissions).  The  above  form  is  preferred  since  it  is  then  possible  to  define  a decreasing 
version  of  the  concept  by  replacing  the  word  increasing  by  decreasing.  This  was  done  by  Brindley 
and  Thompson,  and  ¥ is  said  to  have  DHR  if  it  is  DFR  and  right  comer  set  decreasing  (RCSD), 
i.e.  the  quantity  above  is  decreasing  in  {x<|x<>x5}  for  each  choice  of  x,',  . . .,  xi. 

It  is  easy  to  see  that  a bivariate  exponential  distribution  of  Gumbel  (see  Ref.  [5],  p.  823)  is 
not  IHR  but  does  satisfy  Condition  3.1  of  Section  3,  so  that  this  concept  also  is  not  related  to  the 
monotonicity  concepts  discussed  in  Section  3. 

We  adopt  the  following  notation  for  the  vectors  x=(xi,  . . .,  x,)  and  y=(yi,  . . .,  y%): 

(1)  x>y  means  Xj>2/<  for  i=l,  . . .,  n. 

(2)  /(x)  is  increasing  in  x means  / is  increasing  in  each  component  of  x. 

(3)  For  c {1,  2,  . . .,  n},  K—{\,  2,  . . .,  n}— iiiand  XK={Xi\itK},  where  subscripts 
are  placed  in  ascending  order. 

As  in  Section  4,  the  following  alternate  hazard  rate  concept  can  be  introduced. 

DEFINITION:  The  d.f.  F(xi,  . . .,  x,)  for  the  random  variables  Xi X,  is  IHR, 

(DHR,)  if  it  is  IFR,  (DFR,)  and,  in  addition,  it  is  RCSI,  (RCSD,) ; i.e.  for  all  iiC  c:  {1,2,.  . .,  n} 
and  each  i tK, 

Fu(xk,xt,) 

F(xk,  Xk)  F{xk,  x») 

is  increasing  (decreasing)  in  xn  for  all  xg. 

THEOREM:  A.l : Let  F(x. x.)  be  IHR,  (DHR,).  Then  for  each  tsK, 

Fx,(xk+A,  xf) 

F(xic+^,  xi) 

is  decreasing  (increasing)  in  A for  all  xr- 

PROOF:  For  each  i t K and  for  each  A>0, 

^(xjc-HA,  xg)  ^ .^^.^,(Zic-|-A,  xjf -l-A)  ^ ,.^.F,,(xk,  xr) 

F(xK  + ^,xr)~  ~ F(xjc-|-A,  xir  + A)  “ “ 7'(x*,  a:jr)’ 

where  the  first  inequality  follows  by  RCSI,  (RCSD,)  and  the  second  by  IFR,  (DFR,).  This  gives 
the  result. 

Properties  (l)-(4)  of  Section  4 with  IHR,  (DHR,)  replacing  joint  IFR,  (joint  DFR,)  also 
hold.  In  this  case  (3)  requires  proof,  but  this  follows  by  taking  K in  the  above  theorem  to  bo  the 
set  of  the  indices  of  the  requisite  subset  and  taking  xk=0.  The  proof  of  (4)  is  similar  to  the  proof 
of  (4)  in  Section  4. 

The  following  result  is  a corollary  to  Theorem  4.1. 


L 
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COROLLARY:  Let  T{x, be  IHR  (DHR).  Then 

y^x+A,  xg) 

F(xx,  xg) 

is  decreasing  (increasing)  in  x*  for  all  x^  where  Kc.  { 1,  2,  . . n}. 

PROOF : We  prove  this  only  in  the  case  that  the  first  partial  derivatiyes  of  F are  assumed  to 
exist.  The  proof  in  the  general  case  is  similar  to  an  argument  in  Harris  [6].  First,  F is  IHR,  (DHRa) 
by  the  previous  theorem.  Then,  by  Theorem  4.1,  we  have  that 

yT((a:x+A,  xg) 

F(xx+A,  Xk) 

is  decreasing  (increasing)  in  A for  each  i t K and  for  all  xg.  Then,  as  in  the  proof  of  Theorem  4.1, 

9 F(Xx+A,  Xk)  . 


F(Xx,  Xk) 


'<(>)0 


and  so  the  result  holds. 

THEOREM  A.2:  Subject  to  the  existence  of  the  first  partial  derivatives  ofthed.i.F.FiaRCSI, 
(RCSDa)  if  and  only  if  F is  RCSI  (RCSD). 

PROOF:  Let  FT  <={1, 2,  . . .,  n}.  Then  ^is  RCSI,  (RCSDa)  means  that 

T'zAxk,  Xk) 

"FiXK,  Xk) 

is  increasing  (decreasing)  in  xx  for  all  t c K.  Then  for  all  t € JY  and  xjE>x]f,  we  have 

^Zi(XKi  xg)  ^ ^ \ F r,(Xx,  Xk) 

7(Xk,Xk)  7'x,(Xk,  Xk) 

This  implies  that  for  all  itK  and  for  x;  > xj, 


and  so  for  xx>xx, 


o<(>)  a Tiwi), 

-'-'^^iT(XK,Xg) 

F(Xk,  Xk) 

F(Xk,  Xg) 


is  increasing  (decreasing)  in  x^.  This  gives  that 

P(Y>x'|Y>x) 


is  increasing  in  {x<|x«>x|}  and  is  RCSI  (RCSD).  The  argument  is  reversible  and  so  the  result 
is  proven. 

COROLLARY:  Under  the  assumptions  of  Theorem  A.2  on  F\  Fia  IHRa  (DHR,),  if  and  only 
if  y is  IHR  (DHR). 

PROOF:  Apply  Theorems  4.1  and  A.2. 
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ABSTRACT 


The  paper  describes  an  approach  to  the  evaluation  of  the  effectiveness  of  a 
minefield  in  terms  of  the  number  of  mines  that  are  detonated  by  a convoy  of  sweepers 
and  ships  and  the  corresponding  number  of  vessels  that  are  immobilized.  The  posi- 
tions of  the  mines  and  ^e  tracks  of  the  vessels  are  assumed  to  be  known,  which 
means  that  the  evaluation  measures  are  dependent  on  a large  number  of  disjoint 
events,  each  event  being  the  immobilisation  of  particular  vessels  by  particular 
mines.  This  may  render  combinatorial  methods  computationally  infeasible,  but 
by  introducing  approximations  in  the  assumptions,  the  difficulty  can  be  overcome, 
specifically  by  modelling  the  arrival  of  each  individual  vessel  in  the  neighborhood 
of  a mine  by  an  inhomogeneous  Poisson  stream  for  which  the  arrival  rate  is  non- 
sero  only  over  a short  time  interval.  The  plausibility  of  the  approach  is  supported 
by  results  of  a critical-event  simulation  model. 


INTRODUCTION 

The  discussion  given  in  this  paper  concerns  the  characteristics  of  minefields  laid  in  the  path 
of  transiting  ships.  Minesweepers  pass  to  and  fro  across  the  fields  several  times  in  an  attempt  to 
dear  a channel  prior  to  the  arrival  of  ships.  The  term  “vessel”  is  used  to  refer  to  a ship  or  a sweeper. 
The  mines  are  fitted  with  a coimt  device  such  that  a fixed  number  of  actuations  caused  by  passing 
vessels  has  to  take  place  before  a mine  detonates.  The  situation  is  analysed  in  terms  of  the  number 
of  mines  that  detonate  and  the  losses  inflicted  on  the  vessels. 

Several  authors  have  considered  this  problem  ; the  papers  of  Conolly  [1]  and  Hill  [2]  may  be 
mentioned,  but  may  not  be  readily  available  to  most  readers.  The  present  paper  extends  previous 
work  by  the  explicit  introduction  of: 

1.  the  geographical  positions  of  the  mines, 

2.  the  tracks  and  relative  positions  of  the  vessels  in  the  convoy,  and 

3.  the  following  considerations  that  were  found  necessary  for  practical  applications: 

(1)  a mine  usually  has  an  intercount  dormant  period  (the  minimum  time  interval 
between  successive  actuations  of  the  mine) ; 
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(2)  a single  sweeper  may  actuate  a mine  several  times; 

(3)  vessels  may  make  multiple  crossings  of  the  field; 

(4)  vessels  are  subjected  to  navigation  errors; 

(5)  the  reliability  of  a mine  could  be  uncertain ; 

(6)  mines  frequently  have  arming  delays  (initial  period  of  dormancy). 

Factors  1 and  2 imply  that  the  probability  of  a mine  being  actuated  by  a passing  vessel  and 
the  conditional  probability  of  immobilization  given  detonation  can  be  specified  in  terms  of  the 
relative  positions  of  vessel  and  mine.  These  probabilities  of  actuation  and  immobilization  may 
be  any  function  of  the  distance  between  the  nune  and  the  track  of  the  vessel.  In  Refs.  [1]  and  [2], 
an  average  actuation  probability  is  used  as  input  to  the  basic  mathematical  equations  describing 
the  losses  of  vessels  and  mines.  The  spatial  averaging  with  respect  to  the  positions  of  the  mines 
and  the  tracks  of  the  vessels  is  performed  before  the  losses  are  calculated.  The  result  is  not  identical 
to  that  obtained  by  first  calculating  the  losses  for  given  mine  positions  and  vessel  tracks  and  then 
taking  the  spatial  average  value,  which  is  a more  correct  procedure.  Furthermore,  the  introduction 
of  an  average  actuation  probability  for  each  mine  type  may  be  of  little  use  if  there  are  only  a few 
mines  of  each  type  near  the  channel  cleared  by  the  minesweepers.  Another  difficulty  is  revealed  by 
considering  a convoy  with  a very  large  number  of  vessels  crossing  an  extensive  minefield  containing 
mines  that  have  zero  and  nonzero  actuation  probabilities  with  respect  to  the  vessels  in  the  convoy. 
The  use  of  an  average  actuation  probability  for  each  type  of  mine  and  vessel,  on  the  assumption 
that  the  positions  of  the  mines  are  uniformly  distributed  in  the  field,  implies  that  all  mines  of  each 
type  have  the  same  nonzero  probability  of  being  actuated  by  a vessel.  A very  large  convoy  could 
thus  theoretically  cause  all  mines  to  detonate,  whereas  in  reality  only  the  mines  in  a narrow  channel 
would  be  detonated.  An  obvious  way  to  circumvent  this  difficulty  is  to  discard  all  the  mines  outside 
the  actuation  ranges  of  the  vessels,  but  because  actuation  ranges  vary  from  mine  to  mine  and  from 
vessel  to  vessel,  there  is  difficulty  in  deciding  which  mines  should  be  discarded  and  which  should 
not.  A mine  that  has  an  actuation  range  much  greater  than  normal  may  prove  to  be  deadly  for  one 
of  the  important  or  valuable  ships  in  the  convoy;  on  the  other  hand,  choosing  too  wide  a channel 
results  in  too  many  losses.  The  introduction  of  the  inputs  1 and  2 resolves  these  difficulties. 

Extensive  descriptions  of  the  model  and  the  corresponding  computer  program  are  given  in 
Ref.  [4].  Numerical  comparisons  based  on  a detailed  and  much  more  aggregated  critical-event 
simulation  model  support  the  theory.  Consequently,  the  analytical  model’s  advantage  of  being 
much  less  time-consuming  recommends  its  use  in  place  of  simulation  models. 

In  the  present  paper,  the  description  of  the  situation  to  be  analysed  and  of  the  associated 
mathematical  parameters  is  followed  by  a model  for  the  losses  of  sweepers,  ships,  and  mines.  The 
verification  of  the  model  is  then  described,  and  several  extensions  are  introduced.  Finally,  a plausible 
numerical  example  with  fictitious  data  is  given  for  illustration  purposes. 

THE  SITUATION 

For  model  design,  it  is  assumed  that  merchant  ships  and  combat  ships  cross  an  ocean  region 
at  the  same  speed  and  in  the  same  direction.  The  paths  of  the  ships  cross  a minefield,  where  the 
ships  become  vulnerable  to  damage  as  a consequence  of  mine  detonations.  Minesweepers  pass  to 
and  fro  across  the  minefields  several  times  in  an  attempt  to  detonate  as  many  mines  as  possible, 
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thereby  clearing  a channel  through  the  fields  prior  to  the  arrival  of  the  ships.  The  arriving  ships 
then  cross  the  field  along  this  channel,  possibly  preceded  by  minesweepers. 

A vessel  (ship  or  sweeper)  has  prescribed  probabilities  of  actuating  mines  that  are  located  sufiS- 
ciently  close  to  its  track.  When  a mine  is  actuated,  its  “count”  is  raised  by  one.  When  the  count 
has  increased  by  the  number  of  times  fixed  by  the  count  setting,  the  mine  detonates  and  has  a 
prescribed  probability  of  destroying  the  actuating  vessel,  depending  on  its  distance  from  the  vessel 
at  the  instant  of  detonation.  Various  levels  of  damage  can  be  defined,  but  the  one  used  in  this 
paper  is  the  level  which  causes  immobilization  of  the  vessel. 

The  specific  problem  addressed  is  quantification  of  the  total  numbers  of  mines  detonated  and 
vessels  destroyed. 

PARAMETERS 

Vessels  and  mines  are  identified  by  subscripts  i and  j,  respectively.  The  following  parameters 
are  used: 

I : the  total  number  of  vessels, 

J : the  total  number  of  mines, 
nij  : the  count  setting  of  mine  . .,  J, 

ajt  : the  conditional  probability  of  vessel  i actuating  mine  j,  given  that  the  vessel  approaches  the 
mine  and  that  the  mine  is  active,  j=l, . . .,  <7andi=l,.  . ., /, 

P/i  : the  probability  of  vessel  i being  immobilized,  given  that  it  detonates  mmej,j=l,  ...,</ 
and  i=l, 

Vessels  and  mines  are  numbered  from  1 to  / and  from  1 to  i7,  respectively,  in  the  sequence  of 
their  projections  on  a line  parallel  to  the  direction  of  motion  of  the  vessels,  and  such  that  vessel 
number  1 bypasses  mine  number  1 first.  When  a mine  is  actuated,  its  count  is  raised  by  one,  and 
after  ntj  actuations  the  mine  detonates.  No  restrictions  are  imposed  on  the  submodels  that  are  used 
to  obtain  the  actuation  and  damage  probabilities  aji  and  Pj,;  these  submodels  will  depend  on  the 
tracks  of  the  vessels  and  the  positions  of  the  mines. 

In  the  mathematical  derivations,  it  is  assumed  that  any  chosen  vessel  in  the  convoy  can  actuate 
a mine  only  once;  it  is  then  demonstrated  how  the  equations  can  be  modified  to  allow  for  multiple 
actuations  (for  example,  by  a sweeper). 

THE  MODEL 

A vessel  which  is  immobilized  by  a mine  can  obviously  not  actuate  any  subsequent  mine  in  the 
field.  Consequently,  for  each  mine  j and  vessel  i,  the  probability  that  vessel  i detonates  mine  j 
depends  on  whether  any  of  the  vessels  has  been  immobilized  by  one  of  the  mines 

{m,  1}.  The  implication  is  that  the  probability  that  a given  number  of  vessels 

are  immobilized  depends  on  a large  number  of  disjoint  events.  In  fact,  a combinatorial  model  that 
adds  up  the  probabilities  of  these  events  may  become  computationally  impractical  unless  approxi- 
mate evaluation  methods  can  be  introduced. 


A simplification  results  if  it  is  noted  that  the  probability  of  vessel  i being  immobilized  by  mine 

j,  given  which  vessels  out  of  1,  . . .,  (t-1)  were  destroyed  by  the  mines  1 0-I)>  c*® 

expressed  in  terms  of  the  bii^omial  probability  distribution,  if  the  actuation  and  destruction 
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probabilities  for  all  mines  vs  all  vessels  are  identical.  This  means  that  these  probabilities  are  de- 
scribed by  their  spatial  average  values.  However,  for  reasons  previously  given,  this  approach  is  not 
used  in  this  paper.  Furthermore,  when  the  number  of  vessels  immobilized  cannot  be  considered 
small,  the  basic  combinatorial  problem  described  above  in  determining  the  probability  distribution 
of  the  total  number  of  vessels  immobilized  is  imsolved. 

The  method  used  in  this  paper  is  based  on. the  introduction  of  approximations  in  the  model 
assumptions:  vessels  are  assumed  to  arrive  at  the  minefield  in  an  inhomogeneous  Poisson  stream 
with  rate  it(0)  for  $5^0.  Vessel  i is  associated  with  a time  interval  («<,  dj  such  that 

showing  that  vessel  i arrives  prior  to  vessel  Furthermore, 


and 


M(fl)=Ofor  9^  [o,,  6<]. 


Then  vessel  i is  described  by  the  rate  m(9)  over  the  time  interval  [a(,  6 J.  It  turns  out  that  the 
results  of  this  paper  do  not  depend  on  the  choice  of  the  values  of  a<  and  6«  if  the  above  requirements 
are  satisfield. 

At  this  point,  it  is  important  to  describe  how  the  vessels  actuate  the  mines  and  how  mine 
detonation  infiuences  the  progress  of  the  vessels  through  the  field.  To  this  ei.  J,  let  be  the 
rate  in  an  inhomogeneous  Poisson  stream  describing  the  arrival  of  vessel  i at  mine  j.  Then 


m(9). 

0, 


for  9c[a<, 
otherwise. 


The  Mif(9)  are  calculated  recursively  by  (4)  that  is  derived  below.  It  follows  that 
only  for  9e(o(,  6J.  Mine  j is  actuated  in  the  time  interval  [9,  9-i-A9]  with  probability  Mit(9)a>,A9, 
if  9((at,  df].  Actuations  in  different  time  intervals  d9  are  independent.  As  a check  on  the  assump- 
tions, note  that  the  probability  that  vessel  i actuates  mine  1 is  given  by 

. , ,,  y’MH(9)<*iid9  = o(u, 

as  it  should. 

If  P/»(9)  is  the  probability  that  mine  j receives  i actuations  before  time  9,  then  a set  of  dif- 
ferential-difference equations  for  these  probabilities  can  be  written,  with  the  solution 

(1)  E,(«y  exp  (-E/e)) 

where 

(2)  Sj  «yi  dr. 

Putting  i=my— 1 gives  the  probability  p^(9)  that  the  mine  detonates  if  it  is  actuated  once  more  at 
time  9: 


(3) 


I>/W=lV(«i-i)(9)- 
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Note  that  when  all  and  nniv)  for  i=l,  . . / are  equal,  (1)  reduced  to  the  well-known 

Poisson  approximation  of  the  binomial  distribution. 

Remember  that  /are  given.  Assuming  that  the  ^=1,  • ■ .,/ are  known 

for  a particular  value  ofj,  the  mh-i.((®)i  ^=1i  • • ■,  I are  obtained  as  follows: 


where 


Mj+i.  < (9)  — tin  (S)  (1  — Kfi  (0)) 


In  explanation  of  (4)  and  (5),  note  that  njt{e)Maf,pj(6)  is  the  probability  that  vessel  i detonates 
minej'  in  A9.  Then  the  vessel  is  destroyed  with  probability  Pn;  the  arrival  rate  is  modified  accordingly. 

The  total  losses  can  conveniently  be  expressed  in  terms  of  the  probabilities  that  each  ship  i 
detonates  mine  j.  Then 


If  we  write 


it  follows  that  for  dt[ai,  6<], 


Cji~  r nji{d)dd, 

Joi 

ajnC)n  + aji  r Hjt(y)dv, 

n-1  Ja< 


where  the  sum  on  the  right-hand  side  is  zero  for  i=l.  Using  (1),  (2),  (3),  and  (6),  we  obtain  the 
foUowing  expressions  for  the  detonation  probabilities: 

(7)  2>i=l — ^ ®xp  ( Ej^ 

(8)  (-i)*  ®*P  ( ®xp  ( Efi)]‘,  2<i<I, 


where 


(10)  E^=0. 

Integrating  (4)  and  using  (5)  and  (6)  gives 

(11)  Uy+l,  <=C'^< — Ptl9.ll- 

The  values  of  gy,  are  calculated  recursively;  when  gy<  is  known  for  i=l,  ...,/,  the  Cy+i. < are 
given  by  (11)  and  the  gy+i,  < by  (7)  and  (8),  the  recursion  starting  with  Cn=l  and  the  calculable  gu- 
The  probabilities  Xy<  that  each  vessel  i is  destroyed  by  mine  j are  given  by 

(12)  xy,=Py,gy<  j=l J and  t=l I, 

and  the  probabUity -generating  function  Fj(.z)  of  the  number  of  vessels  destroyed  by  mine  j is 


/’/(2)=(jU  *><)  */«)' 
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since 

is  the  probability  that  some  vessel  is  destroyed  by  mine  j. 

As  a check,  note  that  if  the  quantity  Cy+i,  < in  (11)  is  interpreted  as  the  probability  that  vessel 
i arrives  at  mine  (j’+l),  then  (11)  expresses  this  probability  as  the  corresponding  probability 
minus  the  probability  Xn  given  in  (12),  in  accordance  with  intuition. 

Since  the  vessels  are  described  by  Poisson  streams,  the  probability  generating  function  of  the 
total  number  of  vessels  that  are  destroyed  is  obtained  by 

(13)  F(z)=U  Fj(z), 

i-i 

from  which  the  expected  value  x and  variance  can  be  derived: 

Xjf{l—xsi)—2  xju 

All  vessels  are  given  the  value  “1”  in  the  previous  equations.  However,  the  loss  of  a ship  with 
troops  may  be  more  serious  than  the  loss  of  a sweeper.  This  is  described  by  the  introduction  of  the 
combat  values  a,,  i=l,  . . I of  the  vessels.  The  combat  values  can  be  introduced  in  (14)  and 
(15)  by  replacing  x^  in  (14)  by  atXn,  and  the  terms  x,,(l-i„)  and  X}^^  in  (15)  by  i;,(l-iy<) 
and  a^^  ai,  X|^  x,,,  respectively.  Furthermore,  the  losses  within  a particular  group  of  ships,  indicated 
by  the  subscripts  i as  belonging  to  a subset  G of  jl,  . . .,  I},  can  be  obtained  by  restricting  the 
summations  in  (14)  and  (15)  to  those  i for  which  itO. 

The  number  of  mines  that  are  detonated  can  be  obtained  by  putting  P^i=  1 for  all  l^j^J 
and  l^t^/in  (12)  and  using  the  (13)-(15). 

EXPERIMENTAL  VERinCATION 

The  plausibility  of  the  assumptions  on  which  the  theoretical  model  has  been  founded  is  sup- 
ported by  the  results  of  critical  event  simulations.  The  probability  distribution  {/,}  of  the  total 
losses  of  vessels  was  found  from  the  probability  generating  function  F(z)  given  in  (13)  by  using 
a Fast  Fourier  inversion  procedure.  In  order  to  use  the  computer  program  of  Ref.  [5],  one  deter- 
mines an  integer  m such  that  2*~'<J^2"  and  writes  M=2".  Next,  (13)  is  used  to  obtain  the 
Fourier  transform: 

Cr=^F(Bxp(^-i~y^>wheTer=0,  1,2 M-1, 

from  which  the  probability  distribution  {/, ) is  obtained  by  the  inversion  formula 
y,=  2 C'r  ®*P  n=0,  1,2,  J. 

Note  that/,=0  for  n>J. 


(14) 

(15) 
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Regarding  the  probability  distribution  and  the  expected  value  of  the  losses  of  vessels,  it  was 
found  that  the  results  of  the  simulation  model  oscillated  around  the  corresponding  results  from 
the  analytical  model.  These  experimental  errors  did  decrease  and  the  simulation  results  clustered 
more  tightly  around  the  analytical  model  results  when  the  number  of  replications  of  the  simula- 
tion was  increased.  With  100  replications,  the  sampling  errors  were  greater  than  any  systematic 
error  in  the  probability  distribution  and  the  expected  losses  of  vessels  as  calculated  by  the  analyti- 
cal model;  any  systematic  errors  in  the  expected  values  were  almost  negligible  for  all  the  parameter 
values  considered  (see  Ref.  [3]). 

The  standard  deviations  of  the  losses  of  vessels  as  calculated  by  the  two  models  also  agreed 
closely,  except  when  the  number  of  vessels  was  smaller  than  about  30  (in  this  case,  the  standard 
deviation  obtained  from  the  analytical  model  could  be  up  to  25%  higher  than  that  calculated  by 
the  simulation  model). 

EXTENSIONS 

The  practical  value  of  the  model  can  be  extended  by  taking  explicit  accoimt  of  the  following 
factors: 

(a)  the  intercount  dormant  periods  of  the  mines, 

(b)  the  possibility  of  a sweeper  actuating  a mine  several  times, 

(c)  multiple  crossings  of  the  held  by  sweepers  and  ships, 

(d)  navigation  errors  of  the  vessels, 

(e)  the  reliability  of  the  mines,  and 

(f)  arming  delays  of  the  mines. 

A brief  discussion  of  the  implications  of  these  factors  is  now  given. 

Intereoiint  Dormant  Perioda 

These  make  the  held  less  vulnerable  to  sweeping  and  are  described  by  the  introduction  of 
the  following  parameters: 

tji  The  intercount  dormant  period  of  mine  j;  this  is  the  minimum  time  interval  between 
successive  actuations  of  the  mine. 

Pii  The  distance  between  vessel  i and  the  next  following  vessel,  measured  by  the  time 
between  the  instants  when  these  vessels  cross  a line  perpendicular  to  their  direction 
of  motion,  i=l,  . . .,  I. 

DEFINITION: 

dji.  The  probability  that  vessel  i actuates  mine  j,  given  that  the  vessel  has  not  been  immobilized 
by  any  previously  encoimtered  mine.  (Not  conditional  upon  the  mine  being  active.) 

For  each  mine  j,  the  probabilities  dn  for  i=l,  . . .,  / are  calculated  recursively,  starting  with 
i=  1 and  increasing  i i ^teps  of  1 : 

(16)  dii=^ — dju'^aji,  i=l, 


whore  Kft  is  the  set  of  all  vessel  subscripts  k<Ci  such  that  the  time  distance  between  the  vessels 
k and  % is  smaller  than  the  intercoimt  dormant  period  of  mine  j;  that  is, 
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In  (16),  it  is  assumed  that 


and  ^ • 

S d/<=0  when  =♦. 

kJXu 


In  explanation,  note  that  the  events  that  the  various  vessels  k tKn  actuate  mine  j are  mutually 
disjoint  and  that  each  of  these  events  prevents  actuation  of  mine  j by  vessel  i.  The  term  in  brackets 
in  (16)  gives  the  probability  that  none  of  the  vessels  k€Kn  actuates  mine  j,  which  implies  that  the 
mine  is  active  upon  arrival  of  vessel  i and  is  consequently  actuated  with  probability  a^. 

The  probabilities  d^,  are  now  used  in  the  equations  of  the  analytical  model,  instead  of  the 
Basically,  the  intercount  dormant  periods  are  introduced  by  their  effects  on  the  arrival  rates 
of  vessels,  if  one  uses  (4)  and  (5). 


Multiple  Actuations 

Multiple  actuations  of  mine  j by  sweeper  i can  be  described  by  a parameter  giving  the 
expected  number  of  actuations,  under  the  condition  that  at  least  one  actuation  did  occur.  Thus, 
each  term  (or  dj,C),)  of  (7)-(10)  for  has  to  bo  replaced  by 

Multiple  Crossings 

The  equations  can  be  extended  to  the  case  where  a convoy  or  a collection  of  sweepers  passes 
several  times  to  and  fro  across  the  minefield;  for  example,  when  the  escort  of  a convoy  which 
has  just  come  from  port  through  a narrow  channel  is  required  to  escort  a newly  arrived  group  of 
ships  back  through  the  channel  to  port. 

The  quantities  given  by  (9)  and  (10)  and  used  in  (7)  and  (8)  can  be  interpreted  as  actuation 
potentials  that  are  accumulated  on  the  mines  by  the  vessels.  In  the  first  crossing,  the  initial  value 
of  the  actuation  potential  is  zero,  £^o=0,  and  after  all  of  the  vessels  have  crossed  the  minefield  the 
actuation  potential  of  mine  j is  given  by  Similarly,  the  quantities  that  are  calculated  re- 
cursively by  using  equation  (11)  are  the  probabilities  that  each  vessel  i arrives  at  each  mine  j, 
and  are  the  probabilities  that  each  vessel  i exits  from  the  minefield  without  suffering 

immobilization. 

If  each  vessel  returns  along  the  same  track  that  it  used  in  the  first  crossing,  then  a mine  num- 
bered j in  the  return  crossing  had  the  number  (Z— j-f- 1)  in  the  forward  crossing.  If  we  use  a left-hand 
superscript  beside  a quantity  to  denote  the  number  of  the  crossing,  the  second  crossing  is  described 
by  putting 

with 

/=.Ex->+I.  /• 

The  ’qn  are  calculated  by  (7)  and  (8),  with  replacing  £,i  and  replacing  where 

*^7+1,  jAqn 

*Cn='Ci,^i,  «=Ct+i.  i. 


and 
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However,  these  equations  must  be  modified  if  the  tracks  used  in  the  two  crossings  are  related  by 
some  more  general  transformation  such  as  a parallel  translation  or  rotation.  In  such  cases,  the  mines 
must  be  reordered  according  to  their  projections  on  the  new  tracks,  and  the  probabilities  and 
must  be  recalculated  in  terms  of  the  new  distances  between  vessels  and  mines. 

The  total  losses  of  vessels  in  the  second  crossing  are  calculated  by  using  (12)-(15)  and  letting 
qii  in  (12)  be  replaced  by  Following  crossings  are  treated  in  an  analogous  way. 

Navigation  Errors 

Navigation  errors  are  introduced  in  the  calculation  of  the  probabilities  that  ship  i actuates 
mine  j,  given  that  the  ship  approaches  the  mine  and  that  the  mine  is  active.  Let  Du  be  the  distance 
between  the  planned  track  of  the  vessel  i and  mine  let  a>((x)  be  the  probability  that  the  vessel 
actuates  the  mine,  given  the  distance  z to  its  track;  let  nn(x)  be  the  probability  distribution  of 
navigation  errors  measured  by  the  distance  of  the  vessel  i from  its  planned  track  when  it  passes 
mine  j.  Then 

a,,=J 

Note  that  the  above  approach  slightly  violates  the  principle  that  the  losses  should  be  deter- 
mined for  given  tracks  of  the  vessels  before  taking  spatial  averages.  However,  the  mean  tracks  of 
the  vessels  are  retained  in  the  above  description  such  that  the  dependences  between  the  tracks  of 
the  vessels  are  essentially  retained.  Averaging  with  respect  to  the  mean  tracks  of  vessel  can  then 
be  performed  afterwards. 

Reliability  of  Mines 

The  reliability  of  a mine  can  be  described  by  the  probability  that  it  functions  properly;  these 
probabilities  are  used  to  multiply  the  corresponding  actuation  and  damage  probabilities  an  and 

P,i- 

Arming  Delays 

If  a mine  j is  armed  during  the  transition  of  a convoy  through  the  field,  this  can  be  described 
by  replacing  the  actuation  probabilities  a^  by  the  probabilities  a*t,  where  a*,— 0 until  the  mine 
is  armed;  if  arming  occurs  when  vessel  io  passes  the  mine,  then  a*,=an  for  i>io. 


NUMERICAL  EXAMPLE 


A simple  numerical  example  is  now  given.  The  convoy  consists  of  .30  ships  following  each  other 
in  a single  row,  crossing  a field  of  50  mines  arranged  in  10  lines  of  5 mines  each,  the  lines  being 
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perpendicular  to  the  direction  of  motion  of  the  ships.  The  ships  and  mines  are  numbered  according 
to  the  previously  described  rules.  If  we  consider  the  first  ship  and  the  mines  with  numbers  1,  2,  3, 
4,  and  5 in  the  first  line  of  the  field,  the  actuation  and  immobUization  probabilities  are  as  follows; 

(/„=0.10,  dj,=0.20,  d„=0.40,  (/„=0.20,  d,i=0.10; 

P„=0.05,  P„=0.10,  P, 1=0.20,  P„=0.10,  P„=0.05. 

Each  actuation  and  immobilization  probability  increases  by  0.01  when  the  sequence  number  of 
the  ship  increases  by  1,  such  that 

du=d.(=0.09+0.01.i;  dji=d4.=0.19+0.01.i;  d„=0.39+0.01.i; 
Pn=P»«=0.04+0.01.i;  Pa,=P4,=0.09+0.01.i;  P„=0.19+0.01.i;  (i=l,  . . .,  30). 

The  actuation  and  immobilization  probabilities  within  each  of  the  10  lines  of  mines  are  iden- 
tical. If  we  write  j=5(n-l)-l-»n  for  integers  n and  m where  l<n<10  and  l<m<5,  it  follows  that 

and  Py,=P„,  for  7=1 «/,  i=l,  ...,/. 

The  count  settings  of  the  mines  in  a line  are  all  equal;  it  is  10  in  the  first  line,  9 in  the  second  line 
and  so  on,  decreasing  by  1 from  one  line  to  the  next,  so  that 

my=10— (n— 1). 

The  numerical  values  of  the  probabilities  given  above  reflect  the  fact  that  the  track  of  the 
vessels  passes  directly  above  mines  with  numbers  j=5(n-l)-\-3  for  n=l,  . . .,  10;  the  other 
mines  are  located  some  distance  away  from  the  track.  For  each  ship  and  mine,  the  actuation  and 
immobilization  probabilities  decrease  when  the  distance  between  the  ship  and  the  mine  increases. 

The  probability  distribution  of  the  number  of  ships  that  are  immobilized  and  the  corresponding 
expected  value  and  standard  deviation  are  given  in  the  Tables  1 and  2,  respectively.  The  losses 
are  calculated  by  the  analytical  model  and,  as  a check,  by  a critical  event  simulation  model.  Tables 
1 and  2 designate  as  1,  2,  and  3,  respectively,  those  numerical  results  obtained  by  the  analytical 
model,  the  critical  event  simulation  model  with  100  replications,  and  the  simulation  model  with 
1000  replications. 


TABLE  1.  Tke  Probability  Distribution 
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The  numerical  results  support  the  statements  made  earlier  that  the  analytical  model  provides 
a satisfactory  substitute  for  simulation.  It  is  also  less  demanding  in  computer  time.  While  such 
comments,  based  on  numerical  comparison,  cannot  be  conslusive,  they  do  support  strongly  the 
hypothesis  concerning  plausibility  of  the  approximate  analytical  treatment  of  a model  whose 
details  are  manifold  and  complicated. 
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ABSTRACT 


An  explicit  steady  state  solution  is  determined  for  the  distribution  of  the 
number  of  customers  for  a queueing  system  in  which  Poisson  arrivals  are  bulks  of 
random  sise.  The  number  of  customers  per  bulk  varies  randomly  between  1 and  m, 
m arbitrary,  according  to  a point  multinomial,  and  customer  service  is  exponential. 
Queue  characteristics  are  given. 


1.  INTRODUCTION 

In  numerous  queueing  theoretic  applications,  arrivals  at  a service  facility  are  bulks,  i.e.  sets 
of  possibly  more  than  one  customer,  while  service  is  rendered,  as  usual,  to  individual  customers. 
In  these  situations,  moreover,  it  is  generally  unrealistic  and  misleading  to  treat  the  bulks  as  being 
of  a specified  constant  size.  Rather,  bulk  size  will  be  a random  variable.  Indeed,  in  many  applied 
settings,  the  arriving  entity  may  be  a physical  unit,  but  represents  to  the  server  a randomly  sized 
“bulk”  of  tasks,  e.g.,  airplanes  arrive  at  a maintenance  facility  each  with  a random  number  of 
required  repairs. 

The  problem  of  randomly  sized  bulks  has  been  treated  in  some  generality  in  a fundamental 
paper  by  Oaver  [2].  Also,  in  recent  books  by  Gross  and  Harris  [4]  and  Kleinrock  [8],  the  difference 
equations  for  the  steady  state  probabUity  distribution  of  the  number  of  customers  in  the  system  is 
derived  for  where  refers  to  the  “Markovian”  arrival  of  bulks  of  random  size  X. 

Naturally,  these  diffei  ce  equations  depend  on  the  probability  distribution  of  X.  In  this  paper, 
we  give  an  explicit  representation  of  the  steady  state  probability  distribution  of  the  number  of 


1 
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customers  for  a particular  distribution  of  X which  seems  appropriate  for  many  applications  by 
virtue  of  its  considerable  flexibility.  In  so  doing,  we  generalize  the  results  of  Harris  [5],  a summary 
of  which  appears  in  Ref.  [4],  section  4.1. 

2.  DESCRIPTION  OF  THE  MULTIPLE  POISSON  BULK  ARRIVAL  QUEUEING  SYSTEM 

Assume  that  bulks  of  sizej  arrive  at  a service  facility  according  to  a Poisson  process  {Nf(t) : t>0} 
with  intensity  X^>0,  j=l,  . . .,  m,  and  that  these  m Poisson  processes  are  mutually  independent. 
Note  that  we  are  placing  an  upper  limit  of  m on  bulk  size.  Under  these  assumptions,  the  super- 
posed process  {Af(():t>0}  with  M(t)=Ni(t)+  . . . is  a Poisson  process  with  intensity 


say.  M{t)  represents  the  number  of  bulks  arriving  in  (0,  t]  and  X is  the  mean  rate  of  bulk  arrival. 
The  process  {Ar(t):t>0},  where  iV(t)=iVi(<)+2iVa(0  + . . . +mNmit),  is  a compound  Poisson 
process  in  which  N{t)  is  the  number  of  individual  customers  arriving  in  (.0,  t].  Clearly, 

and,  thus, 

P^' 

is  the  mean  rate  of  customer  arrival.  Because  the  arriving  bulks  are  governed  by  generally  different 
Poisson  processes,  we  shall  refer  to  this  queueing  system  as  the  tnvUiple  Poisson  huUc  arrival  system. 

Note  that  due  to  the  so-called  “preservation  under  random  selection”  property  of  mutually 
independent,  superposed  Poisson  processes,  the  arrivals  in  {M(0:t>0}  are  bulks  of  random  size 
X,  where  X has  a point  multinomial  distribution  with  P{X=j)  — \jl\,  j=l,  . . m (see  Ross  [9] 
p.  123). 

As  previously  mentioned,  service  is  rendered  to  individual  customers  and  not  to  bulks  as  such. 
We  shall  assume  service  to  be  accomplished  by  a single  server  who  operates  according  to  a Poisson 
process  with  intensity  «i.  Thus,  the  time  expended  per  customer  is  an  exponential  random  variable 
with  mean  I/m- 

To  fix  ideas  at  this  point,  we  can  refer  to  the  following  simple  model.  Sheets  of  plywood  coming 
off  a processing  line  are  rejected  if  they  contain  more  than  m defects,  but  are  sent  on  to  a repair 
facility  if  they  contain  between  one  and  m defects.  The  sheets  are  envisaged  to  arrive  at  the  repair 
facility  according  to  a Poisson  process  with  intensity  X,  while  the  probability  that  a sheet  chosen 
at  random  contains^'  defects  is  2,  . . .,  m.  At  the  repair  facility,  a single  repairman  elimi- 

nates defects  individually,  according  to  a Poisson  process  with  intensity  m- 

Now  let  pt  be  the  steady  state  probability  that  the  bulk  arrival  system  contains  k customers 
(k  defects  awaiting  repair  in  the  plywood  example).  Harris  [5]  has  analyzed  the  present  model  and 
specified  {p»}  for  m=2  only.  We  shall  give  an  explicit  expression  for  pt,  k=0, 1,2,  . . .for  arbitrary 
(but  finite)  m. 

The  multiple  Poisson  bulk  arrival  system  model,  along  with  its  explicit  solution,  seems  par- 
ticularly useful  in  and  readily  applicable  to  the  analysis  of  real  situations.  The  model  assigns  m 
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parameters  to  the  arrival  process,  and  consequently,  it  is  reasonable  to  assume  that  it  would 
provide  an  excellent  fit  to  actual  experience  as  opposed,  for  example,  to  the  assumption  of  geometric 
or  Poisson  bulk  size,  each  of  which  adds  only  one  parameter  to  the  arrival  process.  Yet,  the  number 
of  parameters  incorporated  does  not  pose  a difficult  problem  statistically,  for  X can  be  estimated  by 
the  sample  mean  bulk  arrival  rate  and  X^/X  can  be  estimated  by  the  empirical  frequency  of  bulks 
of  size  y,  7=1,  . . .,  m.  Additionally,  we  have  the  advantage  of  the  explicit  expressions  for  the. 
Pt’s,  which  are  more  convenient  from  the  aspect  of  statistical  control  than  recursive  generation. 
Examples  of  the  application  of  this  model  for  7n=2  are  given  in  Harris  [5]  (see  also  Ref.  [4],  Ex.  4.1 
p.  151-164). 

3.  THE  DIFFERENCE  EQUATION  FORM  OF  THE  STEADY  STATE  DISTRIBUTION 

The  birth-death  equations  for  the  multiple  Poisson  bulk  arrival  system  can  be  derived  from  the 
postulates  associated  with  the  various  Poisson  processes  operative  in  the  model.  These  will  yield 
the  steady  state  solution  in  the  form  of  difference  equations  by  the  usual  methods.  This  procedure 
was  actually  employed  in  Jensen  [7].  Recently,  however,  by  a method  based  on  visual  inspection 
of  the  so-called  state-transition-rate  diagram,  Kleinrock  [8]  (see  also  Ref.  [4],  section  4.1)  derived 
the  steady  state  equations  for  the  bulk  arrival  process  with  a general  bulk  size  distribution.  If  X is 
the  random  bulk  size,  then  for  the  case  treated  here 


(1) 

where 


^>»P(A’=7)  = (X,/X)«(»n-7),  7=1,  2,  . . ., 

4(a:)=l  if  Z>0, 

=0  if  z<0. 


Substituting  given  by  (1)  into  Eq.  (4.44)  and  (4.45)  of  Ref.  [8],  p.  135,  we  obtain  directly  the 
steady  state  difference  equations 


(2) 

(3) 

Define 

(4) 


t-i 


(X+M)p»=Mi>»+i+^  P<X*_(j(m— t-f-i),  Ir>l, 
Xpo=MPi- 

i=l »»• 


It  can  be  seen  after  routine  manipulations  that  (2)  and  (3)  can  be  condensed  to  the  recursive  re 
lationship 


(5) 


t—0,  1, 


where  we  have  implicitly  adopted  the  convention  that  p,»0  if  r<0. 

The  probability  generating  function  (p.g.f.)  of  the  steady  state  distribution  {p^}  can  be 
derived  directly  from  (5),  or  as  a special  case  of  (4.44)  of  Ref.  [8],  p.  13. 
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DenotLog  the  p.g.f.  by  «-(2),  we  obtain 

(6) 
and 
(7) 

where 


(8) 


Po=l  — ^ Pi=l  — P. 

m m 

p- jS  ^ Vm- 


Thus,  p is  the  utilization  factor,  i.e.,  the  ratio  of  the  mean  customer  arrival  rate  to  the  mean 
service  rate,  of  the  multiple  Poisson  bulk  arrival  system.  It  follows  immediately  from  Gaver 
[2,  Theorem  9]  that  p<l  is  a sufficient  condition  for  the  existence  of  the  steady  state  solution  and, 
in  particular,  that  po=l— p- 

That  we  shall  be  able  to  give  explicit  expressions  for  the  p^s  is  of  some  mathematical  interest, 
since  these  expressions  represent  the  inverse  of  ^(z),  a function  not  amenable  to  inversion  by 
analytical  methods  for  arbitrary  m. 

4.  THE  EXPUCIT  STEADY  STATE  DISTRIBUTION 

The  p.g.f.  (6)  cannot  be  inverted  nor  can  the  difference  equation  (5)  be  solved  by  algebraic 
methods  for  to>4.  This  problem  turns  out  to  be  a version  of  the  insoluble  problem  of  determining 
the  zeroes  of  at  least  a quintic  equation.  Algebraic  solution  for  2<m<4,  while  conceptually  feasible, 
is  from  a practical  viewpoint  too  cumbersome,  as  seems  evident  from  Harris  [5]  for  to=2.  There 
are  some  indications  that  an  analytic  solution  is  conceptually  possible  through  the  use  of  the 
imbedded  Markov  chain  technique  and  certain  combinatorial  probabilistic  arguments,  but  again, 
for  arbitrary  m,  this  approach  seems  unduly  tedious  (see  Harris  [6]).  Consequently,  we  resort  to 
solution  by  inspection,  which  seems  justified  in  view  of  the  relative  simplicity  of  the  difference 
equation  (5). 

First  we  specify  some  notation.  The  s}mibol  [z]  will  denote  the  greatest  integer  less  than  or 
equal  to  the  real  number  x.  The  multiple  combinatorial  symbol 


(») 

with 


(m  \ ml 

ii,  (if) I 

m=^  it 


will  be  employed  with  the  usual  conventions;  viz,  0!  = 1 and  the  left  hand  side  of  (9)  will  be 
identically  0 if  it<0  for  some  k. 

For  m=2,  i.e.,  bulk  size  limited  to  no  more  than  two,  it  is  not  too  unwieldly  to  use  (5)  to 
recursively  generate  several  terms  of  {pt}-  In  fact,  in  [7],  pt,  k=l,  . . .,  8,  were  generated,  and  by 
examination  of  these  terms,  we  conjectured  that  in  general 


(10) 


(i, ”.-«)• 
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It  is  easily  demonstrated  that  (10)  is  a version  of  the  results  obtained  by  Harris  (6)  so  that 
(10)  is  true  for  m=2.  Proceeding  similarly  for  m=3,  several  successive  terms  of  {p*}  were  generated 
through  (5)  and,  by  inspection,  we  conjectured  that  in  general 


(11) 


n-2t,-i,  \ 

it,  n— 3i|— 2i,/ 


»i>0. 


At  this  point,  it  becomes  apparent  that  these  conjectures  may  be  generalized.  Specifically,  a 
solution  for  arbitrary  m is 


l.(n+ 

(12)  l>«+i=Po  £ 

where  we  define 


“ ^ Vi. i, (»+!)/ 


, n>0. 


(13)  ».(n+l)=(n  + l)-g;t„ 
and 

(14)  It  (n+1)  = [(n+1)  - (^5^  it,)  jkj- 

The  brackets  in  (14)  refer  specifically  to  the  “greatest  integer  in”  notation.  Also,  (12)-(14) 
are  written  in  terms  of  n+ 1 rather  than  n for  later  convenience. 

It  is  easily  seen  that,  subject  to  the  initial  condition  Po=l— p,  p<l  (see  (7)  and  (8)),  any 
sequence  po,  pi,  pa,  . . . satisf3dng  (5)  sums  to  one.  Thus,  to  show  that  (12)  generates  the  explicit 
steady  state  probabilities,  it  sufiSces  to  show  that  the  sequence  p,,  p,,  . . . given  by  (12)  satisfies 
the  difference  equation  (5).  We  now  proceed  to  demonstrate  this. 

We  shall  need  the  combinatorial  identity  (cf.  Ref.  [3],  Ch.  24) 


(15) 


/ n+m  N A/  n+m— 1 

\ii, . . .,  i„,j  \x,, . . .,  it_i,  it  1,  i*+i, . . .,  i*. 


Given  the  conventions  specified  following  (9),  (15)  is  well  defined  when  i,=0  for  some  r. 

First,  note  that  with  n=0  in  (12),  the  initial  condition  pi=pipo  is  met.  Then,  application 
of  (16)  to  (12)  gives 

HI 

P»+i=^  P*I?»i 

where 


(16) 


ii  («+!)  — 1+^  t, 

. . .,  x*-l, . . .,  Xi(n+1), 


k 1,  . . .,  m and  ii“ii(n+l).  Thus,  we  must  show  that  if»=p«+i_*,  where  p,  is  defined  by  (12). 

First  consider  R,,  i.e.,  in  (16)  take  i=l.  Suppose  that  for  some  r>l,  i,>/,(n) ; in  particular, 
suppose  t,=/,(n)  + l.  Then 


ii{n+l)—l=n—'^jij=n—'^jit—r(I,{n)-\-l)=r 

h‘r 
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For  any  x,  x—[x]  is  the  fractional  part  of  x and  is  thus  strictly  less  than  1;  clearly,  then 
ii(n+l)  — 1 is  negative.  This  causes  the  combinatoric  in  (16)  to  be  0 and,  hence,  we  may  replace 
all  the  upper  limits  therein  by  /(n).  Also  in  (16)  with  it=l,  we  can  replace  {i(n+l)  — 1 by  ii(n), 
since  i|(n+l)  — l=ii(n).  After  making  these  replacements  in  (16),  comparison  with  (12)  indicates 
that  Ri—pn. 

Now  consider  Rt  for  any  it>l.  If  it=0,  the  combinatoric  in  (16)  is  clearly  0,  so  we  may  begin 
the  kih  summation  with  tt=l.  Then,  after  changing  the  index  u to  i'^,  where  1,  and  identi- 

fying i/=iy,  i^l,  k,  we  see  that  in  terms  of  the  new  indices 

nS)  ii(n-f-l)=n-t-l-gii,-i(t;-|-l) 

iftit 

= (n-iH-l)-jU  ji',=ii(n-k+l) 

and 

^ t/— l=n— i-f-1— i't 


Thus,  (16)  becomes  for  t>l 
(19) 


=n-i-i-i-ju  (i-i)t;=jg  t;. 

R.-p.-’S”  ■ ■ ■ “IS-  ■ ■ ■ T (5  '‘X 


Comparison  of  (19)  with  (12)  and  (18)  with  (13)  indicates  that  if  we  can  replace  the  upper 

summation  limits  in  (19)  with  /,(n— i-|-l),  r=2 m,  then  i?»=p,_»+i  as  we  are  attempting 

to  prove.  To  this  end,  suppose  first  that  r<Jk-  Then 


(20) 


Now  suppose  r>i  and  that  i',>/,(n— t-j-l)  and  in  particular  that  i'=/,(n— ifc-t-l)-i-l.  In 
this  case,  by  (18). 

(21)  i,'  (n-t-l-l)=n-jfc-|-l-g  ii;-r(/,(n-ifc-|-l)-|-l)<0 

by  the  same  steps  as  in,  and  the  argument  following,  (19).  Thus,  for  r'^k,  the  combinatoric  in  (19) 
is  0 for  t^>/,(n— it-f-1).  Finally,  it  is  easy  to  see  that 

/»(n-|-l)-l=/»(n-t-|-l). 


(22) 
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By  (20)-(22),  each  of  the  upper  limits  in  (19)  can  be  changed  without  effect  to  /,(n— Jfc+1), 
r=2,  . . .,n». 

The  proof  is  essentially  complete.  We  need  only  verify  that  for  iir>n+l,  Rt=0  in  keeping 
with  our  convention  that  P,+i_t=0  for  i>n+l.  Now,  ii;>n+l  implies  )i:>l  so  that  (19)  is  ap- 
propriate. By  (21),  i'i{n—k+l)<n—k-{-l<iO  for  ky-n+l,  which  implies  P,=0. 

5.  QUEUE  CHARACTERISTICS 

For  completeness  and  convenience  we  provide  here  some  of  the  queue  characteristics  for  our 
particular  case  of  the  system  (which  we  obtain  from  known  results  concerning  other 

queueing  systems). 

Let  N be  the  number  of  customers  in  the  multiple  Poisson  bulk  system  with  respect  to  the 
steady  state  distribution.  Then  E(N)  and  Var(N)  can  be  obtained  from  (5)  or  by  the  differentiation 
of  ^(z),  the  p.g.f.  of  N (cf.  Feller  [1]).  Specifically, 

(23)  jhlPo 
and 

(24)  Var  (iV)=(g  j^P>IPo)+(±  jp,lp^’ 

where  po  can  be  expressed  in  terms  of  the  system  parameters  as  in  (7),  and  is  itself  an  important 
queue  characteristic. 

Other  characteristics  of  importance  in  any  bulk  arrival  process  are  waiting  times  for  groups 
and  waiting  times  for  individual  customers.  When  a bulk  is  a physical  unit,  as  in  the  plywood 
example,  waiting  times  for  individual  customers  may  not  be  of  practical  importance,  for  a customer 
cannot,  or  does  not,  leave  the  system  until  the  bulk  of  which  he  is  a member  leaves.  In  many  situa- 
tions, however,  customer  waiting  times  will  have  some  significance.  Thus,  when  families  eat  in 
restaurants,  the  time  which  the  entire  family  waits  to  be  served  is  the  significant  factor,  but  when 
long  distance  buses  stop  for  meals,  individual  pa.s.sengers  are  generally  more  concerned  with  their 
own,  rather  than  the  group’s,  waiting  time. 

Let  W,  be  the  wait  on  queue  for  a bulk  when  the  system  is  in  equilibrium  condition.  That  is 
W,  is  the  elaspsed  time  from  the  arrival  of  the  bulk  to  the  start  of  service  to  its  first  member.  We 
can  calculate  E{W^  for  the  present  model  from  e.g.  (7.17)  of  Guver  [2]  or  by  suitably  modifying, 
well-known  results  from  M/G/1.  We  obtain. 

(25)  £(W.)  = p12m(1-p)]-'  f iX,)- 

Now  let  T,  be  the  total  time  a bulk  spends  in  the  system  and  let  X refer,  as  before,  to  bulk  size. 
Then  for  a bulk  of  specified  size,  e.g.  J,  the  expected  total  waiting  time  is  E(T,\X=J)  = E{,W^-\-Jlit 
and,  in  general,  £(T,)=fe’(W’,)-l-p/X. 

Let  T,  denote  the  time  an  individual  customer  sjiends  on  queue  and  let  Aj  be  the  event  that 
a customer  is  the  jth  served  within  a bulk.  As.sume  that  .service  within  a bulk  is  random,  i.e., 
P{,Af\X=J)  = \IJ,  \ <i<J  and  0 otherwise.  Then,  if  bulk  size  is  specified,  e.g.  J,  the  expected 
time  on  queue  for  a member  of  that  bulk  is 
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(26)  E{T,\X=J)=^  E{T^A„  X=J)P{A,\X=J) 

=E{W,)+±  [(i-l)/M](l/J) 

For  the  general  case  in  which  bulk  size  is  not  specified,  the  expected  customer  time  on  queue 
is  E(Tc),  where 

E{T,\X=J)P{X=J) 

j-i 

=Eav,)+(i/2(p/x-i/M)). 

Expected  total  time  in  system  rather  than  on  queue  can  be  obtained  from  (26)  and  (27)  by 
adding  1/m  to  EiTc)  or  E(Tc\X=J),  whichever  is  appropriate. 

It  is  of  some  interest  to  compare  the  multiple  Poisson  bulk  arrival  system  with  the  simple, 
i.e.,  the  M/M/1,  queueing  system,  where  we  assume  that  the  customer  mean  arrival  and  service 
rates  are  the  same  for  both  systems.  Let  N,{N)  denote  the  number  of  customers  in  the  MIMjl 
(multiple  Poisson  bulk  arrival)  system.  That  E{N)-E(N,)^0  for  the  general  bulk 

arrival  system  was  pointed  out  by  Gaver  [2].  In  particular,  for  the  present  model,  by  (23)  and 
well-known  results  for  M/M/1, 

m 

e{N)-e{n.)=j:  ju-Dh/m-P)^] 

or  equivalently 

E(N)/E{N.)=-^  j{j+l)\jl(2  g ix,)- 

In  fact,  in  our  computer  simulations  of  the  multiple  Poisson  bulk  arrival  system,  we  always 
observed  that  N is  stochastically  greater  than  N„  i.e.,  P(iV>n)  >P(iV,>n),  for  all  n (cf.  Ref. 
[7],  Tables  III-IV).  A final  comparison  with  M/M/1  is  that  while  P/N,=n)  decreases  monotonically 
in  n,  P{N—n)  does  not  necessarily  (cf.  Ref.  [7],  Fig.  7). 
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ABSTRACT 

This  paper  presents  a method  for  modeling  cyclic  inputs  to  a congested  system 
in  a discrete  event  digital  simulation.  Specifically,  we  express  the  mean  of  the 
interarrival  time  conditional  on  the  last  arrival  as  a linear  combination  of  harmonic 
components  whose  coefficients  can  be  determined  by  stepwise  regression.  We  also 
assume  that  the  conditional  interarrival  time  normalized  by  its  corresponding  mean 
follows  a distribution  that  is  independent  of  time.  The  result  can,  in  turn,  be  used 
to  generate  the  desired  input  for  a simulation.  An  example  based  on  a set  of  actual 
data  is  used  to  illustrate  the  process  of  parameter  estimation  for  the  model. 


1.  INTRODUCTION 


Although  statistical  methods  play  a central  role  in  the  digital  simulation  of  queueing  systems, 
one  area  in  which  these  methods  have  been  notably  under-utilized  is  the  construction  of  arrival 
generators  for  systems  whose  arrival  frequencies  vary  with  time.  For  example,  it  is  not  unusual 
to  find  arrival  patterns  that  vary  with  time  of  day  and  day  of  week.  Failure  to  acknowledge  these 
patterns,  when  they  exist,  can  lead  to  serious  distortions  in  system  behavior.  In  particular,  treating 
interamval  times  as  independent  and  identically  distributed  when  they,  in  fact,  exhibit  a cyclic 
pattern  removes  the  clustering  of  arrivals  associated  with  successively  shorter  interarrival  times. 
Since  this  clustering  is  a principal  source  of  congestion,  the  effect  of  overlooking  dependence  is  to 
distort  the  congestion  pattern. 

To  put  the  problem  in  perspective,  we  shall  first  describe  the  principal  issues  that  arise  in 
attempts  to  characterize  time-dependent  arrivals  by  empirical  distributions.  For  expository  pur- 
poses, we  consider  an  arrival  pattern  that  depends  on  the  day  of  week  but,  within  day  interarrival 

times,  is  independent  and  uniformly  distributed.  Let  7;  A;=  1, . . .,  if}  be  a sample 

record  of  the  number  of  daily  arrivals  over  K weeks.  Then 
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Fj{i)=K-^  g 1=0,  1, . . 00, 

I(Xn,  »)  (t)  = 

denotes  the  sample  cumulative  distribution  function  for  the  number  of  arrivals  on  day  j of  each 
week.  Suppose  that  we  want  to  generate  an  arrival  on  day  j' . Let  U denote  a uniform  deviate  on 
(0,1).  Then  the  number  of  arrivals  on  day  j'  is  /j.=inf  [i'.U<Ff{i)  \ j=j'  (mod  7)],  and  the  Ar"" 
arrival  on  that  day  occurs  at  j'+F*,  where  {F*;  ^=1,  . . .,  /,.}  is  a sequence  of  independent  uni- 
form deviates.  The  extension  of  the  above  approach  to  include  other  cyclic  variations  is  straight- 
forward (e.g.,  see  Ref.  [4],  p.  6).  The  appeal  of  this  approach  lies  in  its  simplicity  and  complete 
reliance  on  untransformed  uniform  deviates.  Although  alternative  methods  exist  for  generating 
arrival  times  from  empirical  distributions,  the  above  procedure  is  common  enough  so  that  the 
statistical  and  computational  issues  it  raises  merit  serious  attention. 

Statistical  Issues. 

Since  empirical  distributions  only  represent  realizations  of  random  phenomena,  discontinuities 
in  frequency  distributions  and  presence  or  absence  of  extreme  values  are  to  be  expected.  This 
important  drawback  is  intrinsic  to  the  use  of  statistical  data  directly  as  input.  A large  sample 
could  only  partially  rectify  the  situation.  Also,  splitting  a complete  cycle  into  segments  with 
homogeneous  arrival  rates  could  cause  abrupt  behavioral  changes  between  adjacent  segments. 

Computational  Issues 

Since  the  arrival  times  generated  in  the  prescribed  manner  are  unordered  within  a day,  the 
simulation  must  devote  time  to  ordering  them  in  the  process  of  scheduling  each  arrival.  This  leads 
to  multiple  arrival  notices  in  the  list  of  scheduled  events,  a situation  that  reduces  the  computa- 
tional efficiency  of  the  simulation.  An  alternative  approach  is  to  generate  all  arrivals  in  a pre- 
processing program  that  orders  them  chronologically.  However,  the  resulting  saving  in  computa- 
tional efforts  is  not  at  all  clear. 

In  the  next  section,  we  shall  introduce  a procedure  that  uses  interarrival  times  to  generate 
time-dependent  arrivals.  Although  the  procedure  does  not  totally  resolve  these  issues,  they  at  least 
reduce  their  saliency,  especially  the  statistical  aspects  of  the  problem.  Finally,  we  shall  present 
an  example  based  on  actual  data  to  illustrate  the  process  of  parameter  estimation  for  the  model. 

2.  THE  MODEL 
Background 

In  this  section,  we  describe  a procedure  that  uses  interarrival  time  data  to  construct  an  arrival 
generator  for  queuing  simulations.  Lot  T,  denote  the  arrival  time  of  the  t*"  arrival,  and 
Si=Tt  — Ti-i  denote  the  interarrival  time  between  the  (i  — 1)“  and  z***  arrivals.  Suppose  that  we 
represent  the  1“  interarrival  time  S,  conditional  on  Tt^x  as 


0,  i<X,, 

, L 


(la) 
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(lb)  X(7’i-x)  = •E(S'<ir,_,)  = ao+^  (o,  cosfl^rj_j  + 6,  sin  9,r,_i), 

■where  we  assume  that  \{T{-i)'^0  and  |*()r<_i}  is  a sequence  of  independent  random  variables 
with  zero  mean  and  domain  on  the  half  open  interval  [ — X(r,_)),  «).*  In  using  the  harmonic 
function  to  represent  the  mean  conditional  interarrival  time,  the  choice  of  the  {9j|  determines  a 
set  of  frequencies  that  account  for  cyclic  behavior  in  an  arrival  pattern.  As  an  example,  if  we 
want  to  take  the  within-week  cyclic  variations  into  consideration  (e.g.,  the  variations  due  to 
hour  of  the  day  and  day  of  the  week),  and  times  are  measured  in  hours,  then  we  would  use  9,=t/168 
and  8j=j8i  for  j=2,  . . .,  168.  The  {a^\  and  {6^}  in  (lb)  remain  to  bo  estimated  from  the  data. 

To  facilitate  the  generation  of  the  St  during  a simulation,  we  define  a normalized  interanival 
time  Rt  conditional  on  Tt^i  as 

(2)  Rt=St/HTt-i)=l+tt/\(Tt.t). 

We  assume  that  |ii,|r,_,}  is  a sequence  of  independent,  identically  distributed  nonnegative 
random  variables.*  We  shall  have  more  to  say  about  Rt  in  the  sequel. 

Parameter  Estimation 

The  estimation  of  the  {a^|  and  (6^1  in  (lb)  can  be  done  through  the  use  of  some  form  of  re- 
gression analysis,  provided  that  (a)  X(7’,_,)>0,  and  «,>-X(r,_,).  Imposing  the  two  constraints 
on  a regression  analysis  creates  difiiculties  with  regard  to  the  sampling  properties  of  the  method. 
However,  for  all  practical  purposes  we  propose  to  circumvent  the  complication  by  performing 
an  unconstrained  stepwise  regression  to  determine  which  of  the  2n-i-l  coefficients  contribute 
significantly  to  \{Tt-i)  and  then  examining  the  results  to  determine  whether  the  constraints  have 
been  violated. 

Before  we  postulate  an  underlying  distribution  for  the  normalized  interanival  time  Rt  con- 
ditional on  Tt-t,  we  shall  first  look  at  its  sample  mean  E{k,\Tt-i).  Let  \{Tt-t)  be  the  estimate  of 
X(T,_i)  from  the  stepwise  regression  and  consider  the  sequence  where 

_ St  

XiTt.t)  X(2’._,)-|-[x(7’,-,)-X(r._,)] 

St/XjTt-i) 

i-{[x(r,_,)-x(7’._,)]/x(r._,)} 

= (s./x(r..,))  [i-i-jEj  {[x(r._,)-x(T,_,)]/x(r,-,)y]- 

Taking  expectations  of  the  above  expression,  we  obtain 

(3)  1 + {lf\iTt.t))E(St  g {[x( Tt.t)  - X(r,-,)]/x( T,.,)}') ■ 

*Theae  awtimptiona  eaa  easily  be  verified  in  a regression  analysis.  { denotes  a sequence  of  random  vari- 
ables conditional  on  X. 
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If  the  sample  size  is  large,  we  would  expect  the  deviations  X(r,_i)— X(7',_i)  to  be  small  relative  to 
Consequently,  we  choose  to  ignore  the  rightmost  term  of  (3).  In  practice,  a quick  way  to 
check  the  relative  importance  of  the  term  is  to  compute  iBt/N  (N  is  the  sample  size)  and  check  if 
it  close  to  one.  Henceforth,  our  model  specification  implies  that  Ri  is  positive  with  mean  unity  and 
identically  distributed  for  all  i. 

To  make  a distributional  assumption  for  Rt,  at  least  three  distributions  deserve  consideration. 
They  are  the  gamma,  lognormal,  and  Weibull  distributions.  Imposing  the  constraint  that  Rt  has  a 
mean  unity  reduces  each  of  these  to  a one-parameter  distribution.  The  selection  of  a proper  under- 
lying distribution  among  the  three  requires  a consideration  of  the  "goodness-of-fit”  of  the  data 
under  the  postulated  distributions  as  well  as  the  relative  computational  eflSciency  in  generating 
random  variables  under  each  choice.  A detailed  discus.sion  of  the  issues  involved  can  be  found  in 
Ref.  [2],  pp.  208-213  and  Ref.  [4],  pp.  10-12.  In  the  example  to  be  given  in  the  next  section,  we 
find  that  the  use  of  the  Weibull  distribution  is  adequate  as  far  as  the  prescribed  issues  are  concerned. 

For  the  Weibull  case,  Rt  has  the  following  probability  density  function: 


(4) 


/a<W  = 


a[r  (l/a-h  l)]“r“->e-  >’•  r <■/“+■)!  “, 

0, 


0<r<  00 
elsewhere. 


The  maximum  likelihood  estimation  of  a is  the  solution  to  the  equation 

iV/a-fATln  r(l/a-|-l)-iV'l'(l/a-M)/a-|-2  In  R<-[r(l/S-|-l)r2i?  “ In  Rt 

-lr(l/a-|-l)l”  [In  r(l/a4-l)-'P(l/S+l)/a]2ff,“=0, 

where  i('(  ) is  the  digamma  function.  When  2f?,/A^is  veiy  close  to  unity,  the  unconstrained  maximum 
likelihood  procedure  described  in  Ref.  [2]  (i.e.,  a procedure  without  presuming  that  Rt  has  an  unit 
mean)  provides  a convenient  alternative  for  estimating  a. 


3.  AN  EXAMPLE 

To  illustrate  the  use  of  the  procedure  presented  in  the  last  section,  we  use  a set  of  sequences  of 
successive  patient  arrivals  to  a coronary  care  unit  in  a hospital  in  New  Haven,  Connecticut.*  The  sample 
record  covers  a two-year  i>eriod  during  which  arrivals  not  admitted  to  the  unit  were  not  recorded. 
However,  we  were  able  to  identify  the  sequences  that  corresponded  to  the  time  segments  in  which 
successive  arrivals  were  all  admissions.  Based  on  these  data,  an  unconstrained  stepwise  multiple 
regression  with  n=168  yields  an  F value  of  5.257  with  18  and  223  degrees  of  freedom,  which  is 
significant  at  the  99.9  percent  level.  The  fact  that  f?*=0.236  (i.e.,  the  proportion  of  variations 
attributable  to  the  regression  equation)  should  by  no  means  be  construed  as  less  than  satisfactory. 
A priori,  we  know  that  a large  component  of  random  variation  exists  in  the  data.  Fitting  the  har- 
monic terms  removes  the  time-dependent  share  of  variations.  To  check  for  correlation  between 
successive  normalized  interarrival  times,  we  analyzed  the  pairs  (f?,,  i?(+i)  obtained  from  all  sequen- 
ces containing  at  least  three  successive  arrivals  and  found  no  evidences  to  reject  their  stochastic 
independence. 


*The  significance  of  fitting  these  data  to  a time  series  model  for  simulation  purposes  is  described  in  Ref.  [3]. 
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Having  found  S/?</N=0.9984,  we  decided  to  use  the  one-parameter  Weibull  given  in  (4) 
as  the  underlying  distribution  for  R^  to  account  for  time-independent  stochastic  behavior.  The  un- 
constrained maximum  likelihood  procedure  of  Ref.  [1]  gave  1.413.  To  evaluate  the  “goodness- 
of-fit”,  we  compared  the  sample  cumulative  distribution  of  the  with  the  estimated  cumulative 
Weibull  distribution  1— e”'"  and  foimd  that  the  fit  was  unusually  close  in  the  tails  but  not  as  close 
in  the  central  range.  For  details,  see  Ref.  [4]. 

As  mentioned  earlier,  the  regression  analysis  used  was  unconstrained.  A careful  check  of  \{T) 
for  O^T^lfiS  revealed  that  one  negative  value  did  indeed  occur.  Although  negative  values  seem 
inconsistent  with  a model  whose  parameter  estimation  relies  on  positive  data,  we  offer  the  following 
rationalization.  In  fitting  a harmonic  function,  a least-squares  fit  may  produce  negative  values 
when  data  contain  rapid  downward  shifts  in  mean  values  over  relatively  short  intervals.  That  is, 
the  minimizing  unconstrained  curve  must  be  regarded  as  an  interpolation  between  sample  data 
points,  and  it  unfortunately  can  produce  negative  values  when  interpolating  between  the  last  point 
in  a rapidly  decreasing  sequence  of  interarrival  times  and  the  first  point  in  a rapidly  increasing 
sequence  of  interarrival  times. 

One  way  to  handle  the  problem  of  negative  values  is  to  proceed  with  the  use  of  the  estimated 
generate  all  interarrival  times  prior  to  the  simulation,  and  then  order  them  chronologically. 
Although  this  approach  is  computational  time  consuming,  it  solves  the  problem  and  overcomes 
most  of  the  inadequacies  of  the  empirical  approach  mentioned  in  Section  1.  An  alternative  ap- 
proach that  guarantees  X(7’)>0  is  suggested  in  Ref.  [4].  The  procedure  involves  working  with  the 
logarithm  of  St  in  (lo).  Unfortunately,  when  it  was  applied  to  the  same  set  of  data,  we  found  that 
the  fit  of  the  underlying  distribution  for  R,  was  considerably  poorer  than  that  of  the  first  approach 
(Ref.  [4],  pp.  20-22). 


4.  GENERATION  OF  INTERARRIVAL  TIMES 


In  generating  interarrival  times  Si  using  the  aforementioned  procedure,  we  first  recall  that 
Si=Ri\{Ti-i).  If  the  last  arrival  occurred  at  time  Ti-i  and  the  normalized  conditional  inter- 
arrival time  R,  follows  the  Weibull  distribution,  then  the  interarrival  time  for  the  next  arrival  S, 
is  given  by 


Si 


r(l/a-l-l) 


(-In  C/)*'*, 


where  17  is  to  be  generated  from  the  uniform  distribution  n(0,  1)  (e.g.,  see  Ref.  (21,  p.  211).  To 
save  computation  time,  we  can  table  the  values  of  X(0  for  suitably  discretized  values  of  1 in  a 
compV'*  -ycle  (e.g.,  a week)  before  carrying  out  the  simulation  runs. 
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ABSTRACT 


Suppoae  that  we  have  enough  computer  time  to  make  n observations  ot  a 
stochastic  process  by  means  of  simulation  and  would  like  to  construct  a confidence 
interval  for  the  steady-state  mean.  We  can  make  k independent  runs  of  m observa- 
tions each  (n=i'm)  or,  alternatively,  one  run  of  n observations  which  we  then 
divide  into  k batches  of  length  m.  These  methods  are  known  as  replication  and 
batch  means,  respectively.  In  this  paper,  using  the  probability  of  coverage  and  the 
half  length  of  a confidence  interval  as  criteria  for  comparison,  we  empirically  show 
that  batch  means  is  superior  to  replication,  but  that  neither  method  works  well  if 
n is  too  small.  We  also  show  that  if  m is  chosen  too  small  for  replication,  then  the 
coverage  may  decrease  dramatically  as  the  total  sample  size  n is  increased. 


1.  INTRODUCTION 

Let  {Xi,  t^l}  be  a stochastic  process  for  which  we  would  like  to  estimate  the  steady-etate 
mean  n: 

n 

M=lim  XJn  (with  probability  1) 

=lim  XJn. 

(These  limits  exist  and  are  equal  for  all  processes  considered  in  this  paper;  see  Ross  [12],  p.  98.) 
Given  enough  computer  time  to  make  n observations  of  the  process  by  means  of  simulation,  how 
should  one  construct  a confidence  interval  (c.i.)  for  n?  The  difficulty  is  that  for  most  simulations, 
the  observed  process  is  nonstationary  and  autocorrelated.  Thus,  the  techniques  of  classical  statisti- 
cal anal3rsi8  for  independent  identically  distributed  (i.i.d.)  observations  are  not  directly  applicable. 

Five  methods  have  been  suggested  in  the  simulation  literature  for  solving  the  above  problem ; 
replication,  batch  means,  spectrum  analysis,  autoregressive  representation,  and  regeneration 

•This  research  was  supported  by  the  Office  of  Naval  Research  under  contract  N000l4r-76-C-O403  (NR 
047-146). 
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cycles.  (See  Oane  and  Iglehart  [3,  4],  Fishman  [6],  and  Iglehart  [10]  for  descriptions  of  these  meth- 
ods.) However,  there  has  been  no  definitive  effort  made  to  determine  which  of  the  methods  is  the 
best  for  a given  simulation  situation.  Thus,  a simulator  who  is  actually  interested  in  estimating  a 
steady-state  mean  may  not  know  which  method  to  employ. 

In  this  paper,  which  is  the  first  in  a series  on  confidence  intervals  in  simulation,  we  analyze 
and  compare  the  widely  used  methods  of  replication  and  batch  means.  The  methods  are  similar 
in  philosophy  in  that  both  try  to  avoid  autocorrelation  by  breaking  the  data  into  “independent” 
segments.  The  sample  mean  of  the  data  in  each  segment  is  computed  and  the  analysis  for  i.i.d.  ob- 
servations is  applied  to  the  sample  means  to  construct  a c.i.  for  the  steady-state  mean.  Using 
coverage  and  half  length  as  criteria  for  comparison,  we  conclude  from  simulations  of  several  queueing 
and  inventory  systems  that  batch  means  is  superior  to  replication,  but  that  neither  method  works 
well  if  the  total  sample  size  n is  too  small. 

The  remainder  of  this  paper  is  organized  as  follows.  Section  2 describes  the  two  methods  in 
detail.  The  methods  are  empirically  compared  in  Section  3.  Section  4 explains  why  the  empirical 
results  differ  from  theory  and  Section  5 offers  suggestions  as  to  how  the  methods  may  be  improved. 


2.  DESCRIPTION  OF  THE  TWO  METHODS 
A.  RepUeation 

Suppose  we  make  k independent  simulation  runs,  each  of  length  m observations  {n=k  Tn). 
We  accomplish  the  independence  of  runs  by  starting  each  run  from  scratch  (for  queueing  simula- 
tions this  usually  means  that  no  customers  are  present  at  time  zero)  and  by  using  a different  stream 
of  random  numbers  for  each  run.  Let  Xt{m)  (i=  1,  2 it)  be  the  sample  mean  of  the  m observa- 

tions in  the  ith  run.  The  7|(m)’s  are  i.i.d.  random  variables  (r.v.’s)  since  the  runs  themselves  are. 
We  use 

m)=g  X,ifn)/k 


as  our  point  estimator  of  the  steady-state  mean  ii. 

Let  /»(wi)=^[^f(wi)].  Then  by  the  definition  of  is,  ii{m)—*is  as  m— ♦«>.  Furthermore,  if  ji(m)=M 
and  0<v*[^((m)]<  00,  then  by  the  classical  central  limit  theorem  we  have 


I 

(1) 


N{0,  1)  as  ifc-«., 
m)J 


where  N{0,  1)  is  a mean  0,  variance  1,  normal  r.v.,  and  —*  denotes  convergence  in  distribution. 
Furthermore,  (1)  remains  true  if  v*[jlL(it,  m)]  is  replaced  by 

(2)  (k,  »n)]=g  [Z(m)-X(k,  m)]V*(*-l). 

If  the  X«(m)’s  are  normally  distributed,  then  the  ratio 


[3?(*,  m)-is  ]/VJ-[T(i, »)] 


r 
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has  the  t distribution  with  i — 1 degrees  of  freedom  (d.f.)i  and  an  exact  100(1— a) % c.i.  for  n is 
given  by 

(3)  f (i,  m)], 

t __ 

L where  <*_|.  ,_«/j  is  the  1— a/2  point  for  a t distribution  with  i — 1 d.f.  Even  if  the  Xi(ni)’s  are  not 

normally  distributed,  it  is  common  practice  when  k is  small  to  use  (3)  to  construct  a c.i.  for  ii. 
r There  are  two  potential  sources  of  error  when  using  replication  to  construct  a c.i.  for  a steady- 

state  mean:  the  fact  that  n(m)^n  and  the  nonnormality  of  the  Xi(m)’s.  These  errors  will  be 
discussed  in  Section  4. 

B.  Batch  Means 

One  disadvantage  of  replication  is  that  E\^X(k,  m)]  = /t(m)  for  any  k,  so  that  for  fixed  m, 

X(k,m)  is  a biased  estimator  of  /i  no  matter  how  many  replications  are  made.  Since  as 

m—*  00 , suppose,  as  an  alternative  to  replication,  we  now  make  one  long  nm  of  length  n and  then 
divide  the  resulting  observations  Xi,  X2,  . . X^  into  k batches  each  of  length  m.  Let  Xt(m) 

(i=l,  2,  . . k)  he  the  sample  mean  of  the  m observations  in  the  ith  batch.  Once  again  we  use 
X(k,  tn)  as  our  point  estimator  of  >1. 

If  we  choose  m sufBciently  l^ge,  then  the  .X'i(m)’s  will  be  essentially  uncorrelated  (see  Tables  5 
and  7),  and  we  can  estimate  <r’[.X'(A:,  m)]  by  (2).  If  m is  large  enough  so  that  the  Xi(m)'s  are  approxi- 
mately normally  distributed  in  addition  to  being  uncorrelated,  then  the  X,(m)’s  are  also  “inde- 
pendent” (see  Ref.  (6),  p.  142).  We  thus  have  essentially  the  same  situation  as  for  replication 
and  (3)  may  be  used  to  construct  a c.i.  for  /i. 

There  are  three  potential  sources  of  error  when  using  batch  means  to  construct  a c.i.  for  a 
steady-state  mean;  the  correlation  between  the  X,(m)’s,  the  fact  that  the  Xj(m)’s  are  not  identically  | 

distributed  with  mean  m,  and  the  nonnormality  of  the  .?,(m)’s.  I 

For  expository  convenience,  we  will  henceforth  use  the  phrase  “point  estimator  bias”  to  mean  < 

for  replication,  and  to  mean  the  .Y,(7n)’s  are  not  identically  distributed  with  mean  n for  j 

batch  means.  1 

1 

1 

3.  EMPIRICAL  COMPARISON  i 

In  order  to  compare  replication  and  batch  means,  we  simulated  several  well-known  queueing  | 

and  inventory  systems  for  which  analytical  results  are  available.  The  results  of  these  simulations  ] 

are  presented  in  this  section.  ] 

The  random  numbers  { f/*,  i > 1 } used  in  this  paper  were  generated  from  the  following  generator  j 

which  is  available  on  our  version  of  the  Univac  1110:  ] 

r,=  (5'‘r._.-M)  mod  2«  (t=l,  2,  . . .)  j 

f;.=r,/2«(i=i,2 ),  I 


J 


where  Tt  is  a given  seed.  For  a discussion  of  this  generator,  see  Coveyou  and  Macpherson  [2]. 
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The  first  stochastic  system  we  considered  was  the  MjMjl  queue.  We  let  E(A)  = \ (the  mean 
interarrival  time),  £(iS)=0.9  (the  mean  service  time),  p=E{S)IE{A)=0.9  (the  trafiSc  intensity), 
and  then  simulated  the  stochastic  process  {D,,  i },  where  Df  is  the  delay  in  queue  (not  including 
service  time)  of  the  ith  customer  and,  for  our  study,  Di=0  (i.e.,  no  customers  are  present  at  time 
zero).  Our  objective  was  to  construct  90%  c.i.’s  for  the  steady-state  mean  delay  in  queue,  d=8.1. 
We  performed  400  independent  simulation  experiments;  for  each  experiment,  we  considered  n= 
1600,  3200,  6400,  12800  and  k=5,  10,  20,  40;  for  each  n and  k we  constructed  a c.i.  for  d using  both 
replication  and  batch  means.  However,  the  longer  nms  were  continuations  of  the  shorter  runs,  so 
results  for  different  values  of  n are  not  independent.  Furthermore,  for  a given  value  of  n,  exactly 
the  same  stream  of  random  numbers  was  used  to  compute  the  delays  for  the  two  methods. 

In  Figure  1 we  plot,  for  each  n and  k and  each  method,  the  average  half  length  of  the  400 
c.i.’s  vs  the  proportion  of  the  400  c.i.’s  which  covered  d.  The  same  proportions  of  coverage  are  also 
given  in  Table  1.  By  way  of  example,  for  replication  with  n=1600  and  k=5  (i.e.,  5 runs  of  length 
320),  211  out  of  the  400  (or  52.75%)  90%  c.i.’s  contained  d and  the  average  half  length  was  3.11. 
From  Figure  1 we  can  see  that  for  a given  value  of  n,  the  curve  for  replication  lies  above  that  for 
batch  means.  This  implies  that  for  a specified  half  length,  batch  means  has  greater  coverage  than 
replication  or,  alternatively,  for  a specified  coverage,  batch  means  has  a shorter  half  length  than 
replication. 


half 

Itngth 


proportion 
of  coverige 


FIGURE  1.  Average  half  length  vs  proportion  of  coverage  for  various  90%  confidence  intervals  for  d=S.l  in 

an  M/M/l  queue  with  p— 0.9. 


< 
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TABLE  1.  Proportion  of  Coverage  of  Various  90%  Confidence  Intervals  for  d=8.1  in  an  MjMjl 

Queue  with  p=0.9. 


Batch  Means 


0?.  527SJ 
03750 
0.  7525 
0.  8325 


O 1825 

o525o 

0.  7375 


0 


0.  6325 
0.  7675 
0.  8050 
0.  8650 


0.  5750 
0.  6825 
0.  7675 
0.  8125 


0.  4750 
0.  6050 
0.  7200 
0.  7975 


0.  3650 
0.  5225 
0.  6125 
0.7400 


Table  1 shows  that  batch  means  always  has  greater  coverage  than  replication.  To  see  whether 
these  observed  differences  were  statistically  significant,  we  performed  a paired-t  test  for  each  n 
and  k.  Each  test  was  based  on  a sequence  of  400  pairs  of  numbers,  with  each  number  in  each  pair 
being  a 1 or  0,  depending  on  whether  or  not  the  c.i.  covered  d.  For  15  out  of  16  tests,  the  differences 
were  significant  at  least  at  the  99%  level;  in  the  other  case,  the  observed  level  of  significance 
was  95%. 

Notice  from  Table  1 that  if  m is  held  fixed  and  k (and  thus  n)  is  increased  (perhaps  in  an  effort 
to  get  a shorter  c.i.),  then  the  coverage  for  replication  may  be  drastically  reduced.  (See,  for  example, 
the  coverages  on  the  main  diagonal.)  This  degradation  occurs  because  EiX{k,  m)]  = n{in)  for  all  k, 
but  <d\X{fe,  m)]  = ff’[X<(m)]/i(:.  Thus,  as  ifc  is  increased,  a shorter  c.i.  is  constructed  around  m(*w)?^Mi 
resulting  in  a decrease  in  coverage. 

To  determine  the  generality  of  the  above  results,  we  also  simulated  the  M/M/1  queue  with 
p=0.5  and  0.7,  the  M/M/2  queue  with  p=0.9,  and  the  M/M/l/M/1  queue  (the  output  of  an  M/M/1 
queue  is  the  input  to  another  single-server  queue  with  exponential  service)  with  p=0.9  for  each 
server.  In  each  case,  we  obtained  results  similar  to  those  above ; in  particular,  we  foimd  batch  means 
superior  to  replication. 

B.  («,  S)  Inventory  System 

The  second  type  of  stochastic  system  we  considered  was  an  («,  S)  inventory  system  with  zero 
delivery  lag  and  backlogging.  Let  Xt,  Y t,  and  denote,  respectively,  the  amount  of  inventory  on 
hand  before  ordering,  the  amount  of  inventory  on  hand  after  ordering,  and  the  demand,  each  in 
period  i.  If  Xt<^»,  then  we  order  S—Xi  items  {Yi=S)  and  incur  an  ordering  cost  K-\-c-{S—Xi). 
If  Xi>a,  then  no  order  is  placed  (F<=A'()  and  no  ordering  cost  is  incurred.  After  has  been 
determined,  then  the  demand  Qt  occurs.  If  Q|>0,  then  a holding  cost&-(F<— Q<)  is  incurred. 
If  F,— Qi<0,  then  a shortage  cost  r-(Qi—  F,)  is  incurred.  In  either  case,  A^,+,=aF«  — Q|.  For  further 
discussion  of  this  inventory  system  see  Wagner  [13],  p.  Al9. 

For  our  study,  we  let  Q,  be  a Poisson  r.v.  with  mean  25,  «=  17,  and  S=57  (these  values,  which 
are  approximately  optimal,  were  computed  from  a normal  approximation  given  in  [13],  p.  A40), 
Xi=S,  K=32,  e=3,  h=l,  w=5,  and  then  simulated  the  stochastic  process  {E<,  i^l},  where  E,  is 
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the  total  cost  (expenditure)  in  period  i.  Our  objective  was  to  construct  90%  c.i.’s  for  the  steady- 
state  mean  cost  per  period,  e=  112.108.  Once  again,  we  performed  400  independent  experiments  and 
let  k=5,  10,  20,  40;  however,  now  we  let  n=320,  640,  1280,  2560.  In  Tables  2 and  3 we  give  the 
proportion  of  the  400  c.i.’s  which  covered  e and  their  average  half  length,  respectively.  It  is  not 
possible  to  give  a meaningful  graph  of  average  half  length  vs  proportion  of  coverage  since  the 
coverages  for  batch  means  are  all  over  0.90.  (See  Subsection  4.B  for  an  explanation.)  However,  it  is 
clear  from  Table  2 that  in  terms  of  coverage,  batch  means  is  far  superior  to  replication;  in  fact,  the 
observed  differences  are  significant  at  least  at  the  99%  level  for  each  n and  k.  We  feel  that  for  most 
purposes,  these  large  increases  in  coverage  more  than  justify  the  small  increases  in  half  length 
observed  in  Table  3. 

4.  ANALYSIS  OF  ERRORS 


We  have  seen  that  the  actual  coverages  of  c.i.’s  produced  by  batch  means  and  replication 
may  be  considerably  different  from  those  desired.  In  the  following  three  subsections,  we  use  the 


TABLE  2.  Proportion  oj  Coverage  of  Various  90%  Confidence  Intervals  for  e=t  12.108  in  an  («,  S) 
Inventory  System  with  8=17,  S=67,  and  Poisson  Demand. 


TABLE  3.  Average  Half  Length  of  Various  90%  Confidence  Intervals  for  e=112.108  in  an  (s,  S) 
Inventory  System  with  8=17,  S=57,  and  Poisson  Demand. 
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Af/M/l  queue  to  determine  how  the  potential  sources  of  error  mentioned  in  Section  2 affect  the 
two  methods.  We  will  explicitly  mention  the  above  inventory  system  only  when  a source  of  error 
affects  it  differently  from  the  manner  in  which  it  affects  the  M/M/1  queue.  Our  ultimate  objective 
is,  of  course,  to  learn  how  best  to  employ  the  two  methods. 


A.  P(^t  Estimator  Bias 

A requirement  of  both  methods  is  that  the  Xt(m)’s  be  identically  distributed  with  mean  m- 
However,  for  most  real-world  simulations  this  requirement  is  not  met,  and  causes  a diflSculty  in 
estimating  m which  we  called  point  estimator  bias.  LetZ>(/)  be  the  sample  mean  of  Di,  A,  . ■ -,  7>|. 
Wo  can  graphically  see  the  point  estimator  bias  for  the  M/M/i  queue  in  Figure  2 where  we  plot 
^Z>(f)|£>i=0]  (see  Heathcote  and  Winer  [9])  as  a function  of  1.  Note  that  E[D(l)\Di=0\^d  for 
any  I,  but  approaches  it  as  / increases  (as  it  should  by  the  definition  of  d). 

To  determine  the  degradation  in  coverage  caused  by  the  point  estimator  bias,  we  simulated 
the  stationary  M/M/1  queue;  that  is,  an  M/M/1  queue  where  the  number  of  customers  found  by  the 
“first”  arrival  is  a r.v.  with  the  stationary  number  in  system  distribution  (see  Gross  and  Harris 
8],  p.  47).  (In  this  case,  it  is  easy  to  show  that  Z?i  has  the  stationary  delay  in  queue  distribution. ) 
Thus,  {Dt,  i>l}  is  a stationary  stochastic  process,  E[^{[)]=d  for  each  /,  and  there  is  no  point 
estimator  bias.  We  performed  400  independent  simulation  experiments  using  the  same  values  of  n 
and  k and  the  same  stream  of  random  numbers  as  we  did  for  the  usual  MIMjl  queue.  Table  4 
gives,  for  each  n and  k and  each  method,  the  proportion  of  the  400  c.i.’s  which  contained  d.  A 
comparison  of  Tables  1 and  4 shows  that  coverage  in  the  stationary  case  is  greatly  improved  for 
replication  but  only  slightly  improved  for  batch  means.  This  is  not  surprising  since,  for  example, 
£125(320)|Di=0]=6.01  but  ElS(1600)lZ>, =01=7.59  (see  Figure  2).  We  conclude  that  for  the 
values  of  n considered,  point  estimator  bias  is  a nr  ajor  source  of  degradation  for  replication  but  has 
little  effect  on  batch  means. 


TABLE  4.  Proportion  of  Coverage  of  Various  90%  Confidence  Intervals  for  d=8.1  in  a Stationary 

MIMjl  Queue  with  p=0.9. 


Replication 

Batch  Means 

\ * 
n 

5 

10 

20 

40 

5 

10 

20 

40 

1600 

0.  7800 

0 . 8100 

0.  8525 

0.  8925 

0.  6875 

0.  6300 

0.  5350 

0.  4050 

3200 

0.  8150 

0.  8200 

0.  8450 

0.  8975 

0.  7925 

0.  7075 

0.  6275 
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FIGURE  2.  E(S(2)|Pi  =0]  as  a function  of  I for  the  MjMjX  queue  with  p =0.9. 

B.  Correlation  Between  the  X|(m)’8 

Suppose  that  {Xi,  t>l ) is  a (covariance)  stationary  stochastic  process.  For  j=0,  1,  2, . . ., 
let 

Cf=Coy  {Xt,  Xt+j)=c-i, 

C'/(m)=Cov  [X,(ni),  X,+^(m)] 

{Xi{m)  is  the  sample  mean  of  . . .,  X„,  for  i=l,  2,  . . .),  and 

p,(m)=Cj(m)/Co(m). 

Then  it  is  easy  to  show  that  (see  Mechanic  and  McKay  [11]) 

Ci{m)=  2 (l-|t|/TO)C'^+^^m. 

A fundamental  assumption  of  the  method  of  batch  means  is  that  m is  large  enough  so  t hat  the 
X|(m)’<  are  approximately  uncorrelated,  i.e.,  P/(m)  =0  lorj ^0.  Let  us  examine  the  effect  of  violating 
this  assumption.  The  true  variance  of  X{k,  m)  is  given  by  (see  Fishman  [7]) 

(4)  o*{X{k,  TO)]=C,(m)  [^1+2  2 (l-i/*)p/(m)J/*. 
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Denote  the  quantity  in  brackets  by  a{k,  m).  It  can  be  shown  that  (see,  for  example,  Anderson  [1], 
p.448) 

(5)  m)]}=[fc-a(A:,  m)]C,{m)lk{)c-\), 

where  m)]  was  given  by  (2).  Combining  (4)  and  (5)  we  get 

£{^[1  (It,  m)]}=5(A;,  (it,  m)], 

where  h{Jc,  m)—  |[A:/a(A;,  m)]  — 1 l/(fc— 1).  Note  that  b(k,  m)  = l when  a“[X(i,  m)\  is  an  unbiased  esti- 
mator. 

Consider  the  sequence  {D(,  i>  1 j for  the  stationary  M/M/l  queue.  We  can  compute  Cj  (which 
is  positive)  and,  thus,  p>(m)  and  h{lc,m)  from  formulas  given  by  Daley  [5).  Table  5 
gives  pj(m)(j=l,  2, 3)  and  b(k,m)  for  the  values  of  k and  m previously  considered.  Observe  that 
a^[XiJc,m)\  has  anegative  bias  (6(Jt,m)<l)  when  the  Pjimys  are  positive.  To  determine  how  large 
a degradation  in  coverage  is  caused  by  the  bias,  we  repeated  the  simulation  experiments  of  Sub- 
section 4. A using  the  same  random  nmnbers.  However,  for  the  current  experiments  we  divided 
each  of  the  400  variance  estimates  a^{X{k,m)]  by  the  constant  b{k,m).  Table  6 gi^es  the  resulting 
coverages.  We  conclude  from  a comparison  of  Tables  4 and  6 that  the  bias  in  v*[jK(ifc,  m)]  can  be  a 
major  source  of  error  for  batch  means. 

Consider  now  the  sequence  {£<,  t>  1}  for  the  stationary  version  of  the  above  inventory  system 
(i.e.,  Xi  has  the  stationary  number  in  system  distribution  as  given  in  [13],  p.  A48).  We  can  compute 
Cj,  which  can  be  positive  or  negative,  in  a straightforward  but  laborious  manner  and  we  give 
Pi(wi)(i=li  2,  3)  and  b{k,  m)  in  Table  7.  Notice  that  the  py(m)’s  are  now  negative  and  that$[X(A;,m)] 
has  a positive  bias  Q){k,  m)]>l).  This  explains  why  the  coverages  (for  batch  means)  in  Table  2 are 
larger  than  0.90.  However,  we  believe  that  coverage  greater  than  expected  is  not  nearly  as  undesir- 
able as  coverage  less  than  expected.  Thus,  correlation  between  the  X,(m)’s  would  seem  to  be  par- 
ticularly troublesome  only  when  it  is  positive. 


TABLE  5.  Pj{fn)(j=l,  2,  S)  and  b{k,  m)  for  the  Stationary  MjMjl  Queue  with  p=0.9. 


m 

p,(m) 

pj(m) 

Pzim) 

6(5,  m) 

6(10,  m) 

6(20,  m) 

6(40,  m) 

■■ 

0.  830 

0.  645 

0.  514 

mm 

0.  093 

0.  461 

0.  169 

0.  174 

0.  567 

0.  258 

0.  129 

0.  300 

0.  306 

0.  387 

0.  460 

0.  480 

0.  484 

0.  219 

0.  661 

0.  673 

1280 

0.  819 

2660 

0.048 

0.910 

Bli 

L ^ 
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TABLE  6.  Proportion  of  Coverage  of  Various  90%  Confidence  Intervals  for  d=8.1.  in  a Stationary 
MIM/1  Queue  with  p=0.9  and  Variance  Estimate  m)]/b(k,  m). 


Batch  Means 

5 

10 

20 

40 

0.  8175 

0.  8225 

0.  8450 

0.  8575 

0.  8275 

0.  8500 

0.  8525 

0.  8600 

0.8400 

0.  8525 

0.  8725 

0.8700 

0.  8725 

0.  8575 

0.  8600 

0.8850 

TABLE  7.  S)  and  b(k,  m)for  the  Stationary  (s,  S)  Inventory  System  with  s=17,  S=67, 

and  Poisson  Demand. 


0.  124 
0.071 
0.038 
0.020 


-0.006 

-0.000 

-0.000 

-0.000 

-0.000 


-0.000 

-0.000 

-0.000 

-0.000 

-0.000 


6(20,  m) 

6(40,  m) 

2.004 

1.615 

1.324 

1. 164 

2.444 

2.  043 
1.626 
1.327 

C.  Nonnormality  of  the 

We  can  determine  the  effect  of  nonnormality  on  replication  from  Table  4,  since  the  effect  of 
the  other  potential  source  of  error,  point  estimator  bias,  has  been  removed.  Similarly,  we  can  deter- 
mine the  effect  of  nonnormality  on  batch  means  from  Table  6,  since  the  effects  of  the  other  two 
sources  of  error,  point  estimator  bias  and  the  correlation  between  the  7<(m)’s,  have  been  removed. 
We  conclude  that  for  the  models  considered  here,  nonnormaiity  is  not  a major  source  of  error  for 
either  method,  especially  if  it  is  approximately  20  or  greater. 

5.  CONCLUSIONS  AND  FUTURE  RESEARCH 

In  this  paper  we  empirically  showed  that  batch  means  is  superior  to  replication  for  several 
well-known  queueing  and  inventory  systems.  Although  we  cannot  conclude  from  these  results  that 
batch  means  is  superior  for  every  stochastic  system,  we  believe  that  it  is  a wise  choice  unless  some 
additional  information  about  the  system  being  simulated  is  available.  This  and  the  other  major 
conclusions  of  this  paper  are  sununarized  below: 
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(a)  The  major  source  of  error  for  replication  is  point  estimator  bias,  i.e., 

(b)  If  m is  chosen  too  small  for  replication,  then  coverage  may  decrease  as  Jb  is  increased. 

(c)  The  major  source  of  error  for  batch  means  is  the  bias  in  m)]  which  is  caused 

by  the  correlation  between  the  7<(m)’s. 

(d)  Batch  means  appears  to  be  superior  to  replication. 

(e)  If  n is  chosen  too  small,  then  the  actual  coverage  of  either  method  may  be  considerably 
lower  than  that  desired,  regardless  of  the  choice  of  k. 

(f)  If  it  is  possible  to  increase  n for  either  method,  then  it  is  preferable  (in  terms  of 
coverage)  to  hold  k fixed  and  increase  m (see  Tables  1 and  2). 

In  order  for  replication  to  be  a viable  method  for  interval  estimation,  a procedure  is  needed  for 
dealing  with  point  estimator  bias.  To  the  best  of  our  knowledge,  no  such  generally  applicable 
procedure  exists  at  the  present  time.  However,  it  should  be  mentioned  that  research  related  to  this 
problem  is  currently  being  done  by  Ancker  and  Gafarian  at  the  University  of  Southern  California. 

We  have  not  explicitly  discussed  how  to  choose  k and  m for  batch  means.  However,  from  Table 
1 we  know  that  for  some  systems  and  some  values  of  n there  will  be  no  value  of  k which  will  produce 
coverage  close  to  the  desired  level.  We  believe  that  what  is  needed  is  a sequential  procedure  which 
fixes  it  at  a “reasonable”  value  and  then  successively  increases  m (and  thus  n)  until  the  X((m)’s 
are  approximately  uncorrelated.  We  will  present  such  a procedure  in  the  next  paper  in  this  series. 
Empirical  results  which  will  be  reported  there  indicate  that  this  sequential  procedure  works  quite 
well  for  a variety  of  stochastic  models.  Thus,  if  properly  implemented,  batch  means  can  be  a viable 
method  of  interval  estimation.  For  other  research  related  to  the  choice  of  k and  m,  see  Refs.  [7] 
and  [11]. 
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ABSTRACT 


The  purpose  of  this  paper  and  the  accompanying  tables  is  to  facilitate  the 
calculation  of  constrained  optimum  order  quantities  and  reorder  points  for  an 
inventory  control  system  where  the  criterion  of  optimality  is  the  minimization  of 
expected  inventory  holding,  ordering,  and  time-weighted  backorder  costs.  The 
tables  provided  in  the  paper  allow  the  identification  of  the  optimal  solution  when 
order  quantities  and/or  reorder  points  are  restricted  to  a set  of  values  which  do  not 
include  the  unconstrained  optimal  solution. 


1.  INTRODUCTION 

There  are  many  situations  in  which  the  analyst  is  forced  to  choose  values  of  order  points  and 
order  quantities  from  a finite  set  of  alternatives.  If  stock  can  be  ordered  in  multiples  of  say,  m 
units,  it  would  be  under  the  most  rare  circumstances  that  an  unconstrained  optimal  solution 
(UOS)  would  be  equal  to  one  of  the  admissible  lot  sizes.  Similarly,  it  may  not  be  possible  to  measure 
the  inventory  level  at  any  other  than,  say,  n different  levels.  For  example,  it  may  not  be  economical 
to  measure  accurately  the  exact  level  of  content  in  pressurized  gas  containers.  Thus,  for  practical 
purposes,  the  reorder  point  will  be  an  integer  number  of  containers.  The  tables  in  this  paper  allow 
the  identification  of  the  optimal  solution  when  order  point  and  order  quantities  are  constrained. 

The  continuous  review  inventory  system  envisioned  here  is  the  same  as  the  one  examined  by 
Holt,  Modigliani,  Muth,  and  Simon  (HMMS)  (Ref  [4],  p.  226).  The  assumptions  concerning  the 
inventory  system  follow. 

• The  lead  time  is  shorter  than  the  time  between  orders,  the  so-called  “lot  time.” 

• The  order  point  or  trigger  level,  T,  is  nonnegative. 

• The  lead  time  is  constant  and  known. 

• There  is  no  serial  correlation  of  sales  rates  between  periods. 

• An  order  not  satisfied  immediately  from  inventory  is  backlogged. 

• The  backorder  penalty  is  a function  of  the  time  duration  and  amount  of  backorders. 

The  above  assumptions  allow  the  development  of  a total  cost  function  which  HMMS  identify 

as  Model  Two  (Ref.  [3],  Eq.  12-23).  The  cost  function  is  equivalent  to 
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(1)  K(Q,  r)=(7,|+c’,(|+r-s^)+^  (C,+Cc)£°  ns^)ds^, 

where  Cr  is  the  ordering  cost, 

Cj  is  the  holding  cost/unit/year, 

Cd  is  the  backorder  cost/unit/year, 

S is  the  expected  annual  sales, 

Sz,  is  the  expected  sales  over  the  lead  time, 

/(St)  is  the  probability  density  of  St, 

Q is  the  order  quantity,  and 
T is  the  trigger  level  or  reorder  point. 

Unfortunately,  the  values  of  Q and  T which  minimize  K(Q,T)  are  not  easy  to  identify  since,  as 
HMMS  express  it,  “the  integral  above  is  difficult  to  evaluate  for  many  density  hmctions  of  in- 
terest, ...”  in  reference  to  the  integral  in  (1). 

When  demand  over  lead  time  is  normally  distributed  we  may,  with  the  aid  of  the  accompanying 
table,  evaluate  the  integral  in  (1)  and  employ  a procedure  presented  in  the  paper  to  search  for  the 
optimal  values  (constrained  or  otherwise)  of  Q and  T.  Furthermore,  the  tables  allow  the  K(Q,T) 
cost  surface  to  be  easily  generated. 

2.  COMMENTS  ON  RELATED  PAPERS 

The  mathematical  approach  employed  by  Galliher,  Morse,  and  Simond  (GMS)  [3]  to  obtain 
a total  cost  function  is  different  from  the  one  employed  by  HMMS.  Their  method  of  arriving  at 
optimal  values  of  Q and  T is  based  upon  an  approximation  to  the  cost  function  they  derive.  Deemer 
and  Hoekstra  [2]  have  developed  tables  which  identify  the  optimal  values  of  Q and  T for  the 
OMS  model,  but  the  tables  do  not  facilitate  cost  evaluation  or  aid  in  the  search  for  constrained 
optimal  Q,  T strategies.  Koenigsberg  [6]  uses  a model  similar  to  that  of  GMS,  but  adopts  a method 
which  he  states'  to  be  equivalent  to  minimizing  holding  and  ordering  costs  subject  to  a fixed  pro- 
tection against  shortage.  Backorders  are  not  time-weighted.  Thatcher  [8]  uses  a model  in  which 
stockholding  costs  are  proportioned  to  the  maximum  amount  of  stock,  which  seems  a doubtful 
approximation  for  many  applications.  Buckland  [1]  uses  a nomogram  to  simplify  the  joint  calcula- 
tion of  Q and  I,.  Unfortunately,  the  construction  of  the  nomogram  is  left  to  the  reader.  Back- 
orders were  not  time-weighted  in  Buckland’s  treatment.  Lampkin  and  Flowerdew  [7]  present  an 
iterative  procedure  for  the  optimization  of  a related  cost  function,  but  require  the  generation  of  a 
table  of  values  to  be  used  in  the  optimization  procedure. 

Herron  [4]  generated  a series  of  graphs  suitable  for  identifying  the  UOS  for  the  GMS  model. 
The  graphs  in  the  Herron  paper  may  be  employed  to  assess  the  sensitivity  of  the  minimum  cost 
solution  to  changes  in  demand  imcertainty  over  lead  time.  However,  they  cannot  be  used  in  the 
identification  of  a constrained  optimal  solution  (COS),  nor  any  sensitivity  in  cost  to  changes 
in  the  Q,T  strategy  from  the  UOS. 

The  paper  proceeds  in  several  sections.  Section  3 displays  the  derivatives  of  K{Q,  T)  and  Sec- 
tion 4 describes  the  cost  surface  K(Q,  T).  Section  5 describes  properties  of  the  isocost  rings  and  the 
UOS.  Section  6 discusses  t}q>es  of  constraints  that  may  be  incurred  and  procedures  to  identify 
the  COS.  Section  7 illustrates  the  use  of  the  tables  and  presents  several  examples.  The  tables  and 
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optimization  procedure  presented  here  allow  the  constrained  optimal  solution  and  unconstrained 
optimal  solution  to  be  identified  in  several  minutes  of  computations  manually,  or  in  seconds  by 
computer.  The  tables  allow  the  calculation  of  Q and  T to  within  one-tenth  of  one  standard  deviation 
of  demand  over  lead  time. 

S.  PARTIAL  DERIVATIVES  OF  K(Q,  T) 

Denote  as  Dqq  and  Drr  the  second  partial  derivatives  of  K(Q,  T)  with  respect  to  Q and  T, 
respectively.  From  (1),  then,  we  have 

(3)  JJ* 

Note  that  Dqq  and  Dtt  ere  greater  than  zero  for  all  values  of  i^,  Cr,  Gi,  Cd,  eod  Q greater  than  zero. 
4.  THE  TOTAL  COST  SURFACE 

The  isocost  (^IC)  rings  of  the  cost  function  K{Q,  T)  over  the  Q,  T quadrant  form  nested  rings 
whose  size  diminish  as  K(Q,  T)  decreases.  The  rings  are  symmetric  about  their  major  axis  for  rea- 
sons which  will  be  discussed  later  in  this  paper.  The  major  axis  of  the  isocost  ring  is  negative  in 
slope,  and  the  major  axis  of  an  IC  ring  is  closer  to  the  Q,  T origin  the  lower  the  total  cost  value 
associated  with  the  ring.  Figure  1 shows  an  example  where  represents  total  cost  associated  with 
ringj. 

Since  D^q  and  Drr  »re  both  strictly  positive,  any  point  within  an  IC  ring  must  yield  a lower 
total  cost  value  than  any  point  on  the  IC  ring  thus  ensuring  that  the  IC  rings  are  nested.  As  the 
reader  may  expect,  the  cost  surface  becomes  relatively  fiat  near  the  optimal  solution.  Since  the  IC 
rings  are  nested  but  not  concentric,  the  rate  of  increase  in  K{Q,  T)  with  respect  to  movement 
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from  the  UOS  is  very  sensitive  to  the  direction  of  movement.  Thus,  when  it  is  not  possible  to 
implement  the  JJOS,  great  care  must  be  exercised  in  choosing  among  alternative  constrained 
solutions. 

5.  PROPERTIES  OF  THE  UNCONSTRAINED  OPTIMAL  SOLUTION 

It  will  be  shown  that  the  UOS,  under  conditions  discussed  later  in  the  paper,  will  lie  at  the 
point  of  tangency  of  one  line  of  a family  of  parallel  lines  and  a curve  convex  to  the  Q,  T origin. 

Denote  as  Q*(T)  that  value  of  Q which,  for  a given  value  of  T,  minimizes  ^(Q,  T).  An  ex- 
pression for  Q*(T)  may  bo  found  by  setting  the  first  derivative  of  (1)  with  respect  to  Q equal  to 
zero  and  solving  for  Q.  Doing  so  yields 

(4)  QriT)  = [Cx  + C,EBPiT)Y>\ 

where  Ci=2Cr^lCi,  (7j=2St((7/-|-Cc)/C'/,  and  EBP(T)  represents  the  integral  in  (1).  When  T is 
large,  EBP{T)  is  small  and  Q*{T)  approaches  the  familiar  Wilson  lot-size  formula.  From  (1),  it 
foUows  easily  that  EBP(T)  and  therefore  Q*(T  are  monotonically  decreasing  functions  of  T. 

Denote  the  UOS  to  (1)  as  Q**,  T**  and  the  optimal  cost  as  K*.  Since  Q*(T)  is  a single-valued 
function  of  T,  it  follows  that  Q**  must  lie  on  the  curve  defined  in  (4),  i.e.,  Q*{T**)  = Q**.  It  will 
be  assumed  in  the  discussion  to  follow  that  Q*iT)  is  convex,  although  the  assumption  is  not  sup- 
ported by  a proof.  Q*(T)  has  always  been  foimd  to  be  convex  for  all  values  assigned  to  Ci  and  Ct 
by  the  author.  To  ensure  that  Q*(T)  is  convex  for  a particular  problem,  the  curve  may  be  traced 
out  for  selected  values  of  T and  EBP{T)  with  the  aid  of  the  tables.  Under  the  assumption  of 
convexity  of  Q*(T),  the  optimal  solution  will  be  shown  to  be  imique. 

Consider  the  isocost  rings  over  the  Q,  T plane.  Setting  K{Q,  T)  equal  to  some  constant,  say 
1?,  and  solving  (1)  for  Q,  we  obtain 

(5)  Qi'R, 

Thus,  for  a given  value  of  and  T,  (5)  yields  the  value(8)  of  Q for  a given  value  of  T on  the  IC 
ring  associated  with  total  cost  Substituting  from  (4)  yields 

(6)  Qi'R,  T)  =L(R,  T)  ± [{-LiK,  T)y-Q*{Tyr\ 

where  

(7)  Z(S,T)=^-t-S^-r. 

L{K,  7)  is  a family  of  lines  whose  intercept  is 

and  slope  is  —l.AsR decreases,  the  intercept  grows  smaller  and  the  lines  move  toward  the  origin. * 

*Note  that  the  diameter  of  the  isocoet  ring  in  the  K (Q,  T),  T plane  for  a given  value  of  T is  related  to  the 
amount  by  which  T))*  exceeds  Q*(T)\  As  one  goes  to  lower  values  of  K(Q,  T)  holding  Tconstant,  —L(K,  T) 

approaches  Q*{T),  thus  ensuring  that  the  diameter  is  diminishing.  Since  Dqq  is  positive  for  all  Q and  T,  it  follows 
that  Uie  IC  rings  are  nested,  since  all  points  within  (outside)  the  ring  yield  cost  values  less  (more)  than  all  points 
on  the  ring. 
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In  the  upper  frame  of  Figure  2,  three  members  of  the  family  of  lines  Q=L(K,  T)  are  drawn, 
with  L(K*,  T)  denoting  the  line  associated  with  the  minimum  value  of  K{Q,  T).  For  values  of  T 
such  that  T)  exceeds  Q*{T),  it  follows  from  (6),  as  shown  in  Frame  a of  Figure  2,  that  the 
isocost  ring  has  two  values  of  Q{^,  T).  Thus  it  follows  from  (6)  that  for  values  of  It.  and  7 such  that 

(8)  L{K,  T)'>Q*{T),  Q{K,  D has  two  real  solutions, 

(9)  L(R,  T)=Q*(T),  Q(R,  T)  has  one  real  solution, 

(10)  L{K,  T)<^Q*{T),  Q{R,  T)  has  no  real  solution. 

If,  for  a given  T,  say  7*,  Q{R,  7»)  has  two  real  solutions,  say  Q\  and  Q»,  then  by  definition  of  the 
ICring,  _ 

(11)  K(«„  7.)=Jf(Q.,  7»)=K. 

But  since  it  follows  that  there  must  be  a X,  0<X<1,  such  that 

(12)  K{Q„  TuXR 

where  <ik=XQi+(l  Thus,  R cannot  bo  optimum  if  for  some  7 it  is  found  that  L(R,  7)> 

C*(7).  The  value  of  R which  corresponds  to  the  line  tangent  to  Q*{T)  yields  one  value  of  Q(R,T) 
for  one  value  of  7 and  imaginaiy  values  of  Q(R,  7)  for  all  other  values  of  7.  Since  all  lines  above 
the  tangent  line  lie  above  Q*iT)  for  some  value  of  7 and  therefore  yield  two  values  of  Q{R,  7), 
those  lines  must  be  associated  with  a value  of  R greater  than  K*.  All  lines  below  the  line  of  tangency 
do  not  intersect  Q*(T)  and  therefore  do  not  yield  real  valued  solutions.  For  Q*{T)  convex,  one  and 
only  one  line  in  the  family  of  LiR^  7)  lines  will  be  tangent  to  Q*iT).  Therefore,  the  point  of  tan- 
gency between  that  line  and  the  Q*(T)  must  be  the  optimal  solution. 

With  the  aid  of  the  tables,  the  convexity  of  Q*(T)  for  a given  problem  may  be  investigated 
and  the  UOS  easUy  identified.  An  iterative  procedure  which  will  identify  the  unconstrained  opti- 
mal solution  in  seven  iterations  is  provided  in  the  appendix. 
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J*  P«  0 


CASE  1 


FEASIBLE 
Q.T  VALUES 


CASE  2 


FEASIBLE 
0,T  VALUES 


CASES 


CASE  4 


CASES 
FEASIBLE 
Q,T  VALUES 


6.  FORMS  OF  CONSTRAINTS  ON  Q AND  T 

CASE  1:  Qy’Qmbi  or  Q<iQmi  where  and  Tis  unconstrained. 

Figure  3 Frame  (a)  displays  an  example  of  the  situation.  Since  Dtt  is  strictly  positive,  the 
value  of  T which  minimizes  K{Q,  T)  for  a given  value  of  Q is  unique.  Since  both  Dqq  and  Dtt  are 
strictly  positive  and  the  IC  rings  are  nested,  it  follows  that  if  Qmia^Q**,  there  is  no  Q greater 
than  ^o,in  which  will  yield  a cost  less  than  Similar  reasoning  holds  for  Q**'>Q^.. 

and  K(QaMj,  r*(<2in»i)).  The  COS  is  found  by  calculating  K(Q,  T)  with  the  aid  of  the  tables  and 
evaluating  successive  values  of  T in  the  direction  of  decreasing  values  of  KiQata,  T).  Clearly,  as 
one  moves  toward  point  a(6)  along  line  Q=Qaia  (Qmtz),  the  cost  function  will  decrease  until  the 
COS  is  passed.  Thus,  incrementing  T by  0.1  unit  from  S/,  -3.0  units  and  evaluating  K(Q,  T)  until 
an  increase  in  K(Q,  T)  is  found  will  identify  the  COS. 

CASE  2:  T^Tatn  or  Tc^T^x  where  Ta\a>T**>Taxx  and  Qis  unconstrained. 

Refer  to  Frame  (b)  of  Figure  3.  The  COS  is  found  directly  by  substituting  the  value  of  T 
min  (or  T max)  in  (4).  By  simUar  arguments  to  those  employed  in  Case  1,  the  value  of  Q which 
minimized  K(Q,  T)  for  a given  T is  unique. 

CASE  3:  Qt,  0*,  ■ > .,  Qk],  T la  unconstrained.  Assume  Qj<Qh.i,  i=l,  . . .,  fc-1. 
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Refer  to  Frame  (c)  of  Figure  3.  It  is  apparent  from  Figure  3 that  as  successive  values  of 
KijCli,  T*(Qj))  are  evaluated,  the  first  nondecreasing  value  of  K{Qj,  T*{Q}))  indicates  that  the 
optimal  constrained  solution  has  been  passed.  Starting  with  j=l,  calculate  KiQ/,  T*(Q,))  as  in 
Case  1.  Continue  to  increment  j until  for  some  w, 

Since  the  isocost  rings  are  nested  and  Dqq  and  Dtt  are  strictiy  positive,  it  follows  that  Q^, 
is  the  COS. 

CASE  4:  TtT={Tu  T,,  T,,  . . .,  Til,  Q is  unconstrained. 

Refer  to  Frame  (d)  of  Figure  3.  Assume  r^<r>+i,j=l,  . . .,  I — 1.  It  is  apparent  from  Figure  3 
that  as  successive  values  of  K{Q*(_Tf),  2/)  are  evaluated,  the  first  nondecreasing  value  of  K{Q*{Tf), 
T])  indicates  that  the  optimal  solution  has  been  passed.  Starting  with  j = l calculate  K{Q*{Tj),  T,) 
as  in  Case  2.  Continue  to  increment  j and  calculate  K{Q*(Tj),  T,)  imtil  for  some  V, 

K{Q*{Ty.,),  Ty-l)>K{Q*(Ty),  Ty)<K{Q*)Ty^{),  Ty^C. 

If  we  employ  arguments  similar  to  Case  3,  it  follows  that  Q*(Ty),  Ty  is  the  COS. 

CASE  5:  Qt^,  Ttr. 

Refer  to  Frame  (c)  of  Figure  3.  If  the  number  of  feasible  Q,  T strategies  is  small,  each  of  the 
iXt  points  may  be  evaluated.  If  the  number  is  large,  the  cost  surface  may  be  generated  to  visually 
locate  the  COS.  Figure  4 indicates  the  flow  of  the  computations  which  will  generate  the  Kify,  T) 
surface  from  to  P-...  and  for  T from  El  —3  to  ^t+3. 

7.  USE  OF  THE  TABLES 

In  order  to  bo  applicable  to  a specific  problem,  the  units  of  measure  must  be  standardized; 
therefore,  all  measurements  are  in  terms  of  standard  deviations.  Thus,  an  annual  sales  rate  of 
1000  units,  a lead  time  of  0.08  year,  an  order  point  of  90,  and  a standard  deviation  of  10  imits 
would  yield  parameters  as  follows: 

5=100,  5t=8,  and  7=9 


The  CV,  Cl,  and  Co  would  bo  in  dimensions  of  $/order,  S/vyear,  and  S/v-year,  respectively. 
Thus,  for  a 7 of  9 and  Sx.  of  8,  the  corresponding  time-weighted  value  is  ERP(7) =0.003700.  The 
interpretation  is  that  for  every  order  cycle  we  expect  on  the  average  to  accumulate 


o.( 


or  1.07  unit  days  of  backorders. 

Continuing,  let  us  assume  that  the  cost  parameters  of  the  problem  are  as  follows:  <7/=S100/<r 
year,  C',=$200,  (7i>=$40,000/<r  year.  We  obtain  C',=400  and  C',=6416.  If  the  procedure  presented 
in  the  appendix  were  employed,  the  UOS  would  be  found  to  be 

P**=20.465  and  7**=9.1. 


(13) 


*1116  evaluation  of  441  points  on  the  K(,Q,  T)  surface  has  been  found  to  require  less  than  1 second  of  CPU  time 
on  a Univac  1110. 


Other,  more  complex  constraining  relationships  would  best  be  examined  by  observing  the  location  of  feasible 
Q,  T values  on  the  K(Q,  T)  surface. 


Table  1 displays  the  results  of  applying  the  techniques  discussed  in  Section  6 in  solving  the 
problem  above  and  constraining  the  solution  by  several  example  methods.  The  values  in  the  Q 
and  T columns  are  cost-minimizing  values  unless  otherwise  constrained.  For  example,  for  Case  1, 
a trigger  level  of  8.8  will  minimize  K{Q,T),  given  Qmust  not  be  less  that  40  and  Tis  unconstrained. 

Note  that  if  the  values  of  Q and  T were  rounded  down  from  the  UOS  to  (for  Case  5)  the  next 
I smallest  feasible  values  of  Q=18  and  7=8,  the  resulting  expected  cost  of  that  strategy  would  be 

12%  higher  than  the  COS  of  Q—22  and  7=10.  Roimding  Q up  to  22  and  7 down  to  8 is  slightly 
, better,  costing  8%  more  than  the  COS.  It  is  clear  that  simply  roimding  the  UOS  values  up  or 

' down  is  not  necessarily  going  to  yield  a very  satisfactory  solution  to  the  constrained  problem. 

i 

i 
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TABLE  1. 


Values 

Case 

Constraint 

COS? 

Cost 

Q 

H 

1. 

Q>40,  T unconstrained 

40.0 

8.8 

yes 

2626.  30 

2. 

T>10.5,  Q unconstrained 

20.0 

10.5 

yes 

2250.00 

3. 

Q*«={18,  22,  26} 

M 

9.2 

no 

2172.  33 

T unconstrained 

mi 

9.  1 

yes 

2162.  90 

m 

9.0 

no 

2214.  88 

4. 

Ttr={<o,  8,  10} 

47.3 

6.0 

no 

4530.  00 

Q unconstrained 

23.8 

8.0 

no 

2380.  00 

20.0 

10.0 

yes 

2200.00 

5. 

Qti>,  Ttr 

18.0 

6.0 

no 

6914.  91 

18.0 

8.0 

no 

2477.  67 

18.0 

no 

2215.  81 

22.0 

6.0 

no 

5984.  92 

22.0 

8.0 

no 

2390.  83 

22.0 

2212.  94 

26.0 

6.0 

no 

5402.  63 

26.0 

8.0 

no 

2392.  24 

26.0 

no 

2272.  49 

i 

j 

■i 


i 


1 


1 

1 
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APPENDIX 

Search  Procedure  for  T** 

It  follows  from  (10)  that  as  H increases,  Lt^,  T)  moves  outward  from  the  origin.  For  Q*(T) 
convex,  the  points  of  intersection  of  a line  L(R,  T)  and  the  curve  Q*{T)  move  apart  and  away  from 
r**  as  R increases.  Thus,  if  an  increase  in  T from,  say,  T„  to  Ti  along  the  curve  Q*(T)  results  in  an 
increase  such  that  K(Q*(Tt),Ti)'^KiQ*iT„),T^),  then  all  points  beyond  Ti  may  be  eliminated 
from  consideration  in  the  search  for  T**.  Conversely,- if  a decrease  in  T from,  say,  T,  to  T,  results 
in  an  increase  such  that  K{Q*{T,),T,)'^K{Q*{T,),T,),  then  all  points  beyond  T,  may  be  eliminated 
from  consideration.  The  search  procedure  makes  use  of  these  observations. 


For  each  value  of  Si,  there  are  61  tabled  values  of  T— 5*.  The  search  procedure  will  be  used  to 
identify  the  tabled  value  of  T and  Q*(T)  which  minimizes  (1).  Let  n,  1 <n<61  denote  a row  of  the 
table.  Let  7’y=Si  — 3.1  + (0.1)  N)  where  Nj  denotes  a value  of  n.  Let  K)=K{Q*{Tj),  T))  where 
Q*(Tf)  is  determined  in  (4). 

The  search  procedure  at  each  iteration  eliminates  from  further  consideration  sets  of  values  of 
n.  At  the  start  of  each  iteration,  the  uneliminated  values  of  n are  divided  into  three  mutually 
exclusive  sets.  Given  Ni  and  Nt,  sets  St,  St,  and  St  are  formed. 

If  Ni<^Nt,  St  contains  values  of  n<Nt, 

Si  contains  values  of  n>Nt, 

St  contains  values  of  n>iVi  and  <.lVj. 

If  Nt'^Nt,  St  contains  values  of  n>A^i, 

Si  contains  values  of  n<Ni, 

St  contains  values  of  n'^Nt  and  <iVi. 

The  procedures  may  now  be  presented. 

STEP  1.  Set  JV,=24,  JV,=38. 

STEP  2.  Form  sets  St,  St,  and  St  <ts  indicated  above. 

STEP  3.  Calculate  Kt  and  Kt. 

STEP  4.  If  Ki'^Ki,  eliminate  set  Si  from  further  consideration  and  set  Ni  equal  to  Nt-  If 
Kt<^Ki,  eliminate  set  Sj  from  further  consideration. 

Let  the  larger  of  the  two  remaining  sets  be  denoted  as  So. 


TIME-WEIOHTBD  BACKOBDER  PENALTIES  689 

ITERATION  1.  If  4^  n«iSo,  n^N,  then  A^»=A^i-|-10,  otherwise  Nt=Nt — 10.  Perform  Steps 
2 through  4 and  proceed  to  Iteration  2. 

ITERATION  ».  If  nfS„,  n>Nu  then  iV,=M+4, 
through  4 and  proceed  to  Iteration  3. 

ITERATION  S.  If  ntSo,  n>Ni,  then  N»=Ni+6, 
through  4 and  proceed  to  Iteration  4. 

ITERATION  4.  If  -P  n«5i,  n>JV'„  then  iV,=iV,+2, 
through  4 and  proceed  to  Iteration  5. 

ITERATION  6.  If  -P  n«So,  n>iV„  then  iV,=iV,+2, 
through  4 and  proceed  to  Iteration  6. 

ITERATION  6.  There  are  3 values  that  remain,  one  on  each  side  of  the  present  value  of  N. 
Set  iVj  equal  toiVi+1.  Let  Si  consist  only  of  the  ATj.  S*  is  null.  St  consists  of  the  remaining  two  values. 
Perform  Steps  3 and  4 end  proceed  to  Iteration  7. 

ITERATION  7.  If  So  contains  one  value  of  n,  go  to  Step  5.  It  So  contains  2 values,  let  No= 
Ni—l.  Perform  Steps  3 and  4 and  proceed  to  step  5. 

STEP  5.  Only  one  value  remains.  All  other  values  of  n yield  higher  costs,  thus  the  present 
value  of  Ni  is  the  optimal  value. 

Table  Al  displays  the  rate  at  which  values  su'e  eliminated  as  the  search  progresses. 

Table  A2  presents  the  results  of  application  of  the  search  procedure  to  the  example  problem 
presented  in  Section  7. 

TABLE  Al 


3 

0 

B 

3 

B 

5 

6 

7 

Values  eliminated  at  iteration  j 

24 

14 

10 

B 

B 

2 

1 or  2 

1 or  0 

Total  values  eliminated 

48 

52 

56 

58 

59  or  60 

i 

[ 

[ 


I 

\ 

i 

I. 

i 

L 


TABLE  A2 


Iteration 

Nt 

No 

St 

So 

So 

Kt 

Ko 

Eliminate 

0 

24 

38 

1-24 

25-37 

38-61 

2789.  90 

2181.01 

St 

1 

38 

48 

25-38 

39-47 

48-61 

2181.01 

2180.  08 

St 

2 

48 

52 

39-48 

49-51 

52-61 

2180.  08 

2241.  04 

s% 

3 

48 

42 

48-51 

43-47 

39-42 

2180.  08 

2156.  55 

St 

4 

42 

44 

39-42 

43 

44-47 

2156.  55 

2158.  88 

So 

5 

42 

40 

41 

39, 40 

2156.  55 

2162.  96 

So 

6 

42 

43 

43 

2156.  55 

2156.  76 

So 

7 

42 

41 

m 

BBI 

41 

2156.  55 

2158.  49 

Thus,  the  t/OS  is  O**=20.466  and  r**=9.1. 


otherwise  Ni=Nt~i.  Perform  Steps  2 
otherwise  Nt=Ni—6.  Perform  Steps  2 
otherwise  Ni=Ni—2.  Perform  Steps  2 
otherwise  N3=Ni—2.  Perform  Steps  2 
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A NOTE  ON  THE  SUM  OF  A LINEAR  AND  LINEAR-FRACTIONAL 

FUNCTION 


Siegfried  Schaible 

UnivertHy  of  Cologne 
Cologne,  W.  Germany 


ABSTRACT 


The  sum  of  a linear  and  linear-fractional  function  is  investigated  in  terms  of 
quasi-convexity  and  quasi-concavity.  From  this  we  obtain  some  insight  into  the 
nature  of  local  optima  of  these  functions  useful  in  algorithms. 


Consider  the  optimization  problem 

(1)  sup  {q{x)=a^x+b^x/c^x\xtS],  S^R"  convex,  c’'a:>0 

which  arises  when  a compromise  between  absolute  and  relative  terms  is  to  be  maximized  [11].  From 
a theoretical  as  well  as  algorithmic  point  of  view,  it  is  important  to  get  some  insight  into  the  nature 
of  local  optima  of  (1).  For  linear  programs  (6=0)  and  linear  fractional  programs  (a=0),  we  know 
that  (Al)  a local  maximum  is  a global  maximum,  (A2)  a local  maximum  is  attained  at  an  extreme 
point  of  S [2].  What  are  the  conditions  such  that  at  least  one  of  these  assertions  is  true  for  the  more 
general  problem  (1)?  Recall  that  (1)  can  often  be  solved  by  a convex  programming  procedure  if 
(Al)  holds,  and  a simplex-like  procedure  can  be  applied  if  (A2)  is  true  (S  polyhedral)  [5]. 

As  known  from  the  theory  of  generalized  convex  programming  [5],  (Al)  is  essentially  equivalent 
to  q(,x)  being  quasi-concave  {qcv),  and  (A2)  is  essentially  equivalent  to  g(x)  being  quasi-con  vex 
(gcx)  on  S.  We  therefore  investigate  g(x)  in  terms  of  quasi-concavity  and  quasi-convexity. 

It  is  assumed  that  o^O,  6^0,  and  6,  c are  linearly  independent.  We  need  consider  only  the 
following  cases : 

I.  a,  6,  c are  linearly  independent. 

II.  a,  b are  linearly  dependent,  i.e.  a=itb  (m^O). 

III.  a,  c are  linearly  dependent,  i.e.  a=\c  (XjxsO). 

By  an  affine  transformation  of  variables,  y=Ax,  g(x)  reduces  to 

I-  qiy)=yi+yi/yt 

II-  q(y)=m+ytly* 

III.  q(y)=\y»+yt/y% 
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Such  a transformation  does  not  affect  quasi-concavity  (quasi-convexity)  [7].  It  can  be  shown; 
PROPOSITION : We  have  in  the  above  mentioned  cases 
I : q(x)  is  neither  qev  nor  qex  on  an  n-dimensional  convex  set  S, 

II:  M>0:  q(x)  is  qcv  on  {a;eS'|a’’a:<0}  and  qex  on  {2eS|fl^a:>0},  m<0:  q(x)  is  qcv  on 
{z«S|a’’x<0,  c’’x<— m}  and  {xeiS|o^x>0,  c'^x>—n],  and  qex  on  |x«S|o’’x>0,  e^x<—n}  and 
{xtiS|a’’x<0,  c^x>— ft}, 

III : q(x)  is  qev  oa  S if  X<0,  and  qex  on  S if  X>0. 

PROOF:  I.  Let  y be  an  interior  point  of  ^(S),  and  consider  q(y)=yi+yilyz  on  the  intersection 
I of  A(S)  with  the  hyperplane  yi  = —y»+{yi+y»).  Since  the  set  {y«I|9(y)<a}  for  a =q(y)  is  not 
convex,  q(y)  is  not  qex  on  / [4].  Then  q(y)  is  not  qex  on  ^(S).  Since  —q(,x)  is  of  the  same  form  as 
q{x),  q(x)  is  not  qev  on  S. 

II.  Here  2(l/)=yj/A:(ya),  where  A:(yj)=y»/(y8+/i).  Since  A:"(yj)  = — 2fi/(yj-|-M)’,  k(yt)  is  either  con- 
cave or  convex  depending  on  the  sign  of  ft  and  yj-f-ft.  Thus,  ^(y)  is  a quotient  of  a linear  and  a 
concave  or  convex  function.  Hence,  it  is  qev  or  qex,  respectively  [4]. 

III.  Here  2(y)  = (Xy,*-|-yj)/y,.  For  a concave  (convex)  numerator  the  quotient  is  qev  (qex)  [4]. 
From  the  Proposition  we  see: 

CASE  I:  (Al)  and  (A2)  are  not  true  in  general  (for  examples,  see  Ref.  [3]). 

CASE  II,  ft>0  and  case  III : 

(2)  (Al)  is  true,  if  a^'x^O  on  S, 

(3)  (A2)  is  true,  if  a^’x^O  on  S. 

CASE  II,  ft<0:  the  assertions  (2),  (3)  hold  only  under  more  restrictive  assumptions  (see 
Proposition). 

We  see  that  for  a rather  limited  class  of  problems  (1),  local  optima  have  one  of  the  properties 
(Al),  (A2).  In  Ref.  [11]  it  was  stated  that  (2)  and  (3)  are  valid  for  all  problems  (1).  In  particular, 
the  simplex-like  procedure  in  Ref.  [11]  can  be  applied  only  under  more  restrictive  assumptions. 
If  these  are  not  fulfilled,  a method  by  Ritter  [6]  may  be  used. 

REMARK;  As  seen  above,  in  Case  II  and  III  q(x)  can  often  be  written  as  a quotient  of  a 
concave  and  convex  function.  Here,  an  extension  of  Chames-Cooper’s  variable  transformation  [1] 
relates  (1)  to  a convex  program  [8].  Then,  duality  relations  are  obtainable  for  (1)  [9,  10],  providing 
other  means  of  solving  this  problem. 
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COMMUNICATION  AND  CORRECTION 


In  my  paper,  “State-Dependent  Gap  Acceptance”  (December  1976  NRLQ),  there  is  an 
error  in  the  derivation  of  the  total  delay  distribution  on  page  654.  It  should  have  been  noted  that 
the  random  sequence  {?'<}  of  gaps  faced,  and  the  number  (Af-hl)  of  gaps  needed  for  merging, 
are  dependent  and  not  independent  stochastic  variables.  Therefore,  the  formula  for  the  delay 
CDF  Bait)  must  be  recalculated  for  this  synchronous  model  as 

Bi>(t)=Pr  {delay  D<t} 

=S  Pr  (wait  is  m gaps}-  Pr  {D<t\M=m]. 

m«0 

Given  that  it  took  (m-H  1)  gaps  for  the  merge,  we  immediately  know  that  every  one  of  the  first  m gaps 
faced  was  less  than  the  critical  gap.  Hence,  the  CDF  associated  with  each  of  these  m gaps  (say 
6(t))  is  the  underlying  gap  distribution  truncated  at  To;  so  0(t)=F(,t)/(l—p)  for  0<t<2’o. 

Thus, 

Bo{t)=±  p„0*^»Ht). 

Since  complete  independence  has  been  removed  from  the  random  sum  D«,  we  must  rederive 
an  expression  for  its  mean.  First,  we  note  from  the  revised  expression  for  Bait)  that  now 

E[D]=±  p^ (m E[T\T<To])=E[M]  E[T\T<To]. 

1)1*0 


All  other  results  in  the  paper  carry  through  with  0{t)  and  E[T\TK.T^  replacing  F{t)  and  E{T], 
respectively. 

The  major  new  formulae  are  thus 

iJo Jo 


£[rir<Tn 

p* 


and 


from  which  we  must  now  obtain 


f(rr) 

E[T\T<T^^^] 


0- 


+EIT\T<TS''], 

This  leads,  then,  to  revised  critical  gaps  {TS\  To)  in  the  illustrative  example.  There, 


(1) 

and 

(2) 


To=G-' 


r E[T\T<T^^]  1 

lp,+E[T\T<TrU 

r gtr|r<r.]  i 

Lp+£[T’|T’<r*]J' 
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But  the  gap  CDF  is  assumed  to  be 
Thus 


C.  M.  HABRI8 


and 

(3) 


Equations  (1),  (2),  and  (3)  are  then  combined  to  give  one  nonlinear  equation  for  T?  and  another 
for  To-  These  two  equations  are  individually  solved  approximately  to  find  the  final  values  of  the 
two  critical  gaps. 


Carl  M.  Harris 

Syraeum  Unvmtily 
Byraeim,  N.Y. 
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